我有一个csv文件,其内容如下:
"DepartmentID","Name","GroupName","ModifiedDate"
"1","Engineering","Research and Development","2008-04-30 00:00:00"
I have
create external table if not exists AdventureWorks2014.Department
(
DepartmentID smallint ,
Name string ,
GroupName string,
rate_code string,
ModifiedDate timestamp
)
ROW FORMAT DELIMITED FIELDS TERMINATED BY '","' lines terminated by '\n'
STORED AS TEXTFILE LOCATION 'wasb:///ds/Department' TBLPROPERTIES('skip.header.line.count'='1');`
加载数据后
LOAD DATA INPATH 'wasb:///ds/Department.csv' INTO TABLE AdventureWorks2014.Department;
数据未加载。
select * from AdventureWorks2014.Department;
上述查询没有返回任何结果。
我认为每个字段周围的双引号是问题所在。有没有一种方法可以将这样的文件加载到Hive表中,而不必剥离双引号?
smallint
类型,不应该使用双引号
。应该使用1
代替"1"
。 - Venkata Dorisala