请问有没有人能够建议如何在将文档插入MongoDB集合时处理文档大小超过16MB的错误。我得到了一些解决方案,例如使用GridFS。通过使用GridFS可以解决此问题,但是我需要不使用GridFS的解决方案。是否有办法使文档变小或分割成子文档。如果有,我们怎么实现呢?
from pymongo import MongoClient
conn = MongoClient("mongodb://sample_mongo:27017")
db_conn = conn["test"]
db_collection = db_conn["sample"]
# the size of record is 23MB
record = { \
"name": "drugs",
"collection_id": 23,
"timestamp": 1515065002,
"tokens": [], # contains list of strings
"tokens_missing": [], # contains list of strings
"token_mapping": {} # Dictionary contains transformed tokens
}
db_collection.insert(record, check_keys=False)
我遇到了错误DocumentTooLarge: BSON document too large。在MongoDB中,最大的BSON文档大小为16兆字节。
File "/usr/local/lib/python2.7/dist-packages/pymongo-3.5.1-py2.7-linux-x86_64.egg/pymongo/collection.py", line 2501, in insert
check_keys, manipulate, write_concern)
File "/usr/local/lib/python2.7/dist-packages/pymongo-3.5.1-py2.7-linux-x86_64.egg/pymongo/collection.py", line 575, in _insert
check_keys, manipulate, write_concern, op_id, bypass_doc_val)
File "/usr/local/lib/python2.7/dist-packages/pymongo-3.5.1-py2.7-linux-x86_64.egg/pymongo/collection.py", line 556, in _insert_one
check_keys=check_keys)
File "/usr/local/lib/python2.7/dist-packages/pymongo-3.5.1-py2.7-linux-x86_64.egg/pymongo/pool.py", line 482, in command
self._raise_connection_failure(error)
File "/usr/local/lib/python2.7/dist-packages/pymongo-3.5.1-py2.7-linux-x86_64.egg/pymongo/pool.py", line 610, in _raise_connection_failure
raise error
DocumentTooLarge: BSON document too large (22451007 bytes) - the connected server supports BSON document sizes up to 16793598 bytes.
Stack Overflow
,请在提问时更具体一些:你用什么代码尝试过?(我投反对票是因为没有代码) / 你期望得到什么结果? / 你遇到了什么错误? **需要帮助请查看 "如何提问"**。 - Hille