在elasticsearch中索引文档时出现异常

3
我有一个JSON文件。当我尝试在Elasticsearch中索引时,出现了异常。
index1没有默认映射。
curl -XPOST localhost:9200/index1/talk?pretty=1 -d '
{
    "_id" : ObjectId("503b29efe4b032e338f0581b"),
    "_oid" : NumberLong(1182053),
    "_ugc" : false,
    "_v" : 22,
    "c" : [
        "Destination"
    ],
    "cc" : "AD",
    "co" : "andorra",
    "e" : true,
    "f" : [
        "Destination"
    ],
    "gi" : "3038999",
    "h" : 0,
    "i" : [ ],
    "k" : [
        "soldeu",
        "parroquia de canillo"
    ],
    "kv" : [
        "soldeu"
    ],
    "la" : 42.57688,
    "lc" : 0,
    "ln" : 1.66769,
    "ns" : [
        {
            "n" : "Soldeu",
            "l" : "en",
            "t" : "p"
        }
    ],
    "po" : 0,
    "point" : [
        42.57688,
        1.66769
    ]
}'

堆栈跟踪:

org.elasticsearch.index.mapper.MapperParsingException: Failed to parse
    at org.elasticsearch.index.mapper.DocumentMapper.parse(DocumentMapper.java:509)
    at org.elasticsearch.index.mapper.DocumentMapper.parse(DocumentMapper.java:438)
    at org.elasticsearch.index.shard.service.InternalIndexShard.prepareCreate(InternalIndexShard.java:287)
    at org.elasticsearch.action.index.TransportIndexAction.shardOperationOnPrimary(TransportIndexAction.java:210)
    at org.elasticsearch.action.support.replication.TransportShardReplicationOperationAction$AsyncShardOperationAction.performOnPrimary(TransportShardReplicationOperationAction.java:532)
    at org.elasticsearch.action.support.replication.TransportShardReplicationOperationAction$AsyncShardOperationAction$1.run(TransportShardReplicationOperationAction.java:430)
    at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
    at java.lang.Thread.run(Thread.java:662)
Caused by: org.elasticsearch.common.jackson.core.JsonParseException: Unexpected character ('O' (code 79)): expected a valid value (number, String, array, object, 'true', 'false' or 'null')
 at [Source: [B@5e7d093a; line: 4, column: 10]
    at org.elasticsearch.common.jackson.core.JsonParser._constructError(JsonParser.java:1284)
    at org.elasticsearch.common.jackson.core.base.ParserMinimalBase._reportError(ParserMinimalBase.java:588)
    at org.elasticsearch.common.jackson.core.base.ParserMinimalBase._reportUnexpectedChar(ParserMinimalBase.java:509)
    at org.elasticsearch.common.jackson.core.json.UTF8StreamJsonParser._handleUnexpectedValue(UTF8StreamJsonParser.java:2094)
    at org.elasticsearch.common.jackson.core.json.UTF8StreamJsonParser.nextToken(UTF8StreamJsonParser.java:561)
    at org.elasticsearch.common.xcontent.json.JsonXContentParser.nextToken(JsonXContentParser.java:48)
    at org.elasticsearch.index.mapper.object.ObjectMapper.parse(ObjectMapper.java:461)
    at org.elasticsearch.index.mapper.DocumentMapper.parse(DocumentMapper.java:494)
    ... 8 more

JSON是来自mongodb的一个文档。我已经安装了以下插件:

ES_HOME/bin/plugin -install elasticsearch/elasticsearch-mapper-attachments/1.4.0 
ES_HOME/bin/plugin -install richardwilly98/elasticsearch-river-mongodb/1.4.0 

请问有人能告诉我我哪里做错了吗?

更新:

错误似乎是由于ObjectId()和NumberLong()引起的。但是,我不想对这些字段进行索引,因此我定义了一个自定义映射来发出这些字段。

自定义映射:

curl -XPUT localhost:9200/index1?pretty=1 -d '{
        "mappings" : {
            "type1" : {
                "_all" : {"enabled" : false},
                "properties" : {
         "ns" : {
            "dynamic" : "true",
                "properties" : {
                  "n" : {
                    "type" : "string"
                  },
                  "l" : {
                    "type" : "string"
                  },
            "t" : {
                    "type" : "string"
                  }
        }
      }
                }
            }
        }
}'

理想情况下,分析器应该省略_id和_oid,但还有没有方法为这些对象提供映射。 ObjectId = org.bson.types.ObjectId,NumberLong = java.lang.Double
2个回答

1

JSON对象不正确。

似乎您的_id属性出现了一些奇怪的问题,ElasticSearch无法解析它。


_id 是一个 ObjectId 字段,类似地,_oid 字段也是一个 NumberLong 字段。我该如何映射这样的字段? - Rahul
我不太明白你的意思,但你不能这样做。我猜想应该去掉那些"_id":"503b29efe4b032e338f0581b","_oid":1182053。 - Marcus Granström
不,文档结构就是这样的。我只需要知道如何映射类型为Object的字段。到目前为止,我只看到了基本数据类型(如int、float、string等)的映射。 - Rahul

0

要从MongoDB文档索引中删除字段,您需要使用脚本:

  1. 安装Javascript插件ES_HOME\bin\plugin -install elasticsearch/elasticsearch-lang-javascript/1.2.0
  2. 在河流设置中添加一个脚本属性:delete ctx.document._id;

无法使用自定义映射删除字段。


网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接