Javascript中将Excel转换为JSON Schema

4
我有一个任务是生成表单Excel表格,其中我必须根据提供的数据类型设计表单。例如:
enter image description here 我正在尝试从上面的Excel数据中制作JSON模式,以便我可以将其插入到MongoDB中动态生成表单。
以下是我正在尝试实现的代码:
            var workbook = XLSX.readFile(req.file.path);
            //console.log(workbook);
            var result = {};
            workbook.SheetNames.forEach(function (sheetName) {
                var roa = XLSX.utils.sheet_to_row_object_array(workbook.Sheets[sheetName]);
                if (roa.length > 0) {
                    result = roa;
                }
            });
            //return result;
            //console.log(result);

            var jsonData = {};
            var dropdown = {};
            var attrTypes = result[0];
            //console.log(attrTypes);

            for (var i = 1; i < result.length; i++) {
                var obj = result[i];
                //console.log(obj);
                for (var key in obj) {
                    var attrName = key;
                    var attrValue = obj[key];
                    if (attrTypes[attrName]) {
                        var type = attrTypes[attrName].toLowerCase().replace(/ /g, ''); // Means type is given                        
                        //console.log(type);

                        if (type === "selectbox") {
                            console.log(attrValue);
                            //var dropdown = attrValue;
                            //console.log(dropdown);
                        }

                    } else {
                        //console.log(type); // Means type is not given
                        jsonData = attrName + ":" + attrValue;
                        //console.log(jsonData);
                    }
                }
            }

期望的JSON输出:

[
{
    Number : 1,
    FirstName : "Abc",
    LastName : "Xyza",
    Dept: ['Finance','Health','Insurance'],
    Country : ['US','Australia','Canada'],
    Year : ['2014','2015','2016'],,
    DateofBirth" : new Date(1937,05,02),
    Gender : ['M','F']    
},
{
    Number : 2,
    FirstName : "Abcd",
    LastName : "Xyzb",
    Dept: ['Finance','Health','Insurance'],
    Country : ['US','Australia','Canada'],
    Year : ['2014','2015','2016'],,
    DateofBirth" : new Date(1948,10,27),
    Gender : ['M','F']    
}
        .
        .
        and so on
]

上面是我试图在MEANSTACK中实现的代码。
任何帮助将不胜感激。

4
请勿贴入一大块代码并要求我们调试。你尝试了什么?实际结果与期望结果有何不同?是否出现了错误信息?请参考这里了解如何提问:http://stackoverflow.com/help/how-to-ask - Ruan Mendes
@JuanMendes:好的,让我更新一篇帖子。 - J.K.A.
期望的 'Dept' 值是一个数组吗?它不应该是下拉框的值吗?我建议先将表格转换为常规字符串 2D 数组,然后再创建对象。 - RainingChain
1
什么是错误/问题?你在问题中提供的信息越多,它就会被更快地回答。 - Tyler
@Mark,你有一堆 console.log。能否给我们提供这些日志的输出结果?请添加一些前缀,例如 console.log("attrValue="+attrValue); - Dominique Fortin
显示剩余3条评论
3个回答

1
你可以使用JS-XLSX库在客户端上读取XLSX和其他Excel格式文件。
你不需要存储下拉菜单或其他要在输入框中填充的值。它们应该被单独存储。因此,数组对象应该像下面这样。
[   
    {  
      "Number":1,
      "FirstName":"Abc",
      "LastName":"Xyza",
      "Dept":"Finance",
      "Country":"US",
      "Year":2014,
      "DateOfBirth":19370502,
      "Gender":"M"
    },
    {  
      "Number":2,
      "FirstName":"Abcd",
      "LastName":"Xyzb",
      "Dept":"Health",
      "Country":"Australia",
      "Year":2014,
      "DateOfBirth":19481027,
      "Gender":"F"
    }
]

下拉框和单选按钮的值应该像下面这样分别存储:
{  
   "Dept":{  
      "type":"dropdown",
      "values":[  
         "Finance",
         "Health",
         "Insurance"
      ]
   },
   "Country":{  
      "type":"dropdown",
      "values":[  
         "US",
         "Australia",
         "Canada"
      ]
   },
   "Year":{  
      "type":"dropdown",
      "values":[  
         2014,
         2015,
         2016
      ]
   },
   "Gender":{  
      "type":"radio button",
      "values":[  
         "M",
         "F"
      ]
   }
}

这两个可以组合成一个模式对象,如下所示:

//included single objects from both for brevity
jsonSchema = {
    array: [
        {  
            "Number":2,
            "FirstName":"Abcd",
            "LastName":"Xyzb",
            "Dept":"Health",
            "Country":"Australia",
            "Year":2014,
            "DateOfBirth":19481027,
            "Gender":"F"
        }
    ],
    inputs: {
        "Gender":{  
            "type":"radio button",
            "values":[  
                "M",
                "F"
            ]
        }
    }
};

注意:在将数据序列化为JSON时,日期类型的值不能存储为日期对象。这些值应该存储为字符串或数字,并且应该在客户端上转换为日期对象。
我已经在这个GIT项目中实现了JSON生成和表单生成。

https://github.com/ConsciousObserver/stackoverflow/tree/master/excelTest

以下是输出的截图。

Generated Forms

这是输出的 JSON。
{
  "array": [
    {
      "Number": 1,
      "FirstName": "Abc",
      "LastName": "Xyza",
      "Dept": "Finance",
      "Country": "US",
      "Year": 2014,
      "DateOfBirth": 19370502,
      "Gender": "M"
    },
    {
      "Number": 2,
      "FirstName": "Abcd",
      "LastName": "Xyzb",
      "Dept": "Health",
      "Country": "Australia",
      "Year": 2014,
      "DateOfBirth": 19481027,
      "Gender": "F"
    },
    {
      "Number": 3,
      "FirstName": "Abce",
      "LastName": "Xyzc",
      "Dept": "Health",
      "Country": "US",
      "Year": 2015,
      "DateOfBirth": 19441029,
      "Gender": "F"
    },
    {
      "Number": 4,
      "FirstName": "Abcf",
      "LastName": "Xyzd",
      "Dept": "Insurance",
      "Country": "Canada",
      "Year": 2016,
      "DateOfBirth": 19481030,
      "Gender": "M"
    },
    {
      "Number": 5,
      "FirstName": "Abcg",
      "LastName": "Xyze",
      "Dept": "Finance",
      "Country": "Canada",
      "Year": 2016,
      "DateOfBirth": 19480604,
      "Gender": "M"
    }
  ],
  "inputs": {
    "Dept": {
      "type": "dropdown",
      "values": [
        "Finance",
        "Health",
        "Insurance"
      ]
    },
    "Country": {
      "type": "dropdown",
      "values": [
        "US",
        "Australia",
        "Canada"
      ]
    },
    "Year": {
      "type": "dropdown",
      "values": [
        2014,
        2015,
        2016
      ]
    },
    "Gender": {
      "type": "radio button",
      "values": [
        "M",
        "F"
      ]
    }
  }
}

@11thdimention:谢谢。 - J.K.A.

0

我觉得您可能混淆了模式与数据的概念。Abc Xyza部门为财务是数据。Excel文件中给出的Department的可能值为财务、卫生或保险则是模式。

以下是一个例子:http://json-schema.org/examples.html

如果问题的标题正确,您需要一个JSON模式,那么我会选择从提供的值中创建字符串数组的函数,针对“下拉列表”或“单选按钮”类型确定列中的值的数据类型(数字为int,FirstName为字符串等),确定最小和最大值甚至允许的字符串模式。

我预见到的输出结果是这样的:

{
    "id" : "http://your.site/form-schema",
    "title" : "Form schema",
    "description" : "JSON schema for autogenerating forms",
    "type" : "object",
    "properties" : {
        "Number" : {
            "type" : "integer"
        },
        "FirstName" : {
            "type" : "string"
        },
        "LastName" : {
            "type" : "string"
        },
        "Dept" : {
            "type" : "string",
            "oneOf" : [
                        { "format" : "Finance"},
                        { "format" : "Health" },
                        { "format" : "Insurance" }
            ]
        },
        "Country" : {
            "type" : "string",
            "oneOf" : [
                        {"format" : "US" },
                        { "format" : "Australia" },
                        { "format" : "Canada" }
            ]
        },
        "Year" : {
            "type" : "integer",
            "oneOf" : [
                        { "format" : "2014" },
                        { "format" : "2015" },
                        { "format" : "2016" }
            ]
        },
        "DateofBirth" : {
            "type" : "string",
            "pattern" : "yyyyMMdd"
        },
        "Gender" : {
            "enum" : ["M", "F"]
        }
    },
    "required" : ["Number", "FirstName", "LastName"],
    "additionalProperties" : false
}

0

这个项目可以从XLSX文件生成JSON。看一下那段代码吧。

它是用Java编写的。它使用apache.poi来解析XLSX文件,使用mongodb.bson来生成JSON。也许它会给你带来一些有用的想法。

这个项目这个项目都是用Javascript编写的。如果你在github上搜索,可能会找到一些有用的代码。


网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接