Datasets are needed for practicing and learning most big data technologies. Here are few datasets that may be useful for ElasticSearch. It may be useful for other big data technologies as well.
Elastic.co Datasets
Below datasets are provided by elastic.co website:
-
The complete works of William Shakespeare, suitably parsed into fields. Download this data set by clicking here: shakespeare.json.
-
A set of fictitious accounts with randomly generated data. Download this data set by clicking here: accounts.zip
-
A set of randomly generated log files. Download this data set by clicking here: logs.jsonl.gz
GroupLens Datasets
GroupLens Research has collected and made available several datasets: grouplens.org/datasets
Learning ES 6.0 Book
Product catalog data taken from amazon.com. The data is downloadable from http://dbs.uni-leipzig.de/file/Amazon-GoogleProducts.zip.
Data for aggregations (chapter 4) at GitHub: https://github.com/pranav-shukla/learningelasticstack.
- heartin's blog
- Log in or register to post comments
Recent comments