Elasticsearch查询返回所有记录

我在Elasticsearch中有一个小数据库，出于测试目的，我想把所有记录拉回来。我正在尝试使用表单的URL…

http://localhost:9200/foo/_search?pretty=true&q={'matchAll':{''}}

有人能给我你要用来完成这个的URL吗?

当前回答

除了@Akira Sendoh，没有人回答如何实际获得所有文档。但是即使是这个解决方案也会使我的ES 6.3服务在没有日志的情况下崩溃。对我来说，使用底层elasticsearch-py库唯一有效的是通过使用scroll() api的扫描助手:

from elasticsearch.helpers import scan

doc_generator = scan(
    es_obj,
    query={"query": {"match_all": {}}},
    index="my-index",
)

# use the generator to iterate, dont try to make a list or you will get out of RAM
for doc in doc_generator:
    # use it somehow

然而，现在更简洁的方法似乎是通过elasticsearch-dsl库，它提供了更抽象、更简洁的调用，例如:http://elasticsearch-dsl.readthedocs.io/en/latest/search_dsl.html#hits

2018-08-08 21:29:03

其他回答

简单!你可以使用size和from参数!

http://localhost:9200/[your index name]/_search?size=1000&from=0

然后逐渐改变，直到你得到所有的数据。

2015-12-14 10:29:43

使用python包elasticsearch-dsl的简单解决方案:

from elasticsearch_dsl import Search
from elasticsearch_dsl import connections

connections.create_connection(hosts=['localhost'])

s = Search(index="foo")
response = s.scan()

count = 0
for hit in response:
    # print(hit.to_dict())  # be careful, it will printout every hit in your index
    count += 1

print(count)

参见https://elasticsearch-dsl.readthedocs.io/en/latest/api.html#elasticsearch_dsl.Search.scan。

2019-05-02 13:14:47

使用server:9200/_stats也可以获得所有别名的统计信息。就像每个别名的大小和元素数量一样，这非常有用，并提供了有用的信息

2014-08-18 13:21:16

调整大小的最佳方法是在URL前面使用size=number

Curl -XGET "http://localhost:9200/logstash-*/_search?size=50&pretty"

注:此尺寸可定义的最大值为10000。对于任何高于10,000的值，它希望您使用滚动函数，这将最大限度地减少对性能的影响。

2016-08-10 13:11:25

默认情况下Elasticsearch返回10条记录，因此应该显式提供大小。

添加大小与请求，以获得所需的记录数量。

http://{host}:9200/{index_name}/_search?pretty=true&size=(number的记录)

注意: 最大页面大小不能超过索引。Max_result_window索引设置，默认值为10,000。

2018-09-28 23:59:17

Elasticsearch查询返回所有记录

推荐文章

最新文章

标签