'Indexing 80 million documents to elasticsearch from Django
I am using the latest version of django elasticsearch dsl and I am using the following command to index around 80 million documents:
python manage.py search_index --rebuild --models <model> --parallel
However, my system can't handle it and crashes at about 12gb of ram usage. CPU usage is 10% and seems to be fine.
Is there a way to index this amount of django entries to elasticsearch safely?
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
Solution | Source |
---|