我有一个表格,其中有一列名为id_list,其中包含一组字符串,例如["1","2","4","6"]
。
原始表格如下:
+-------------------+
| id_list |
+-------------------+
| ["1","2","4","6"] |
|-------------------+
| ["1","2","4","6"] |
|-------------------+
| ["1","2","4","6"] |
|-------------------+
| ["1","2","4","6"] |
|-------------------+
| ["7"] |
|-------------------+
我通过将id_list列进行拆分,创建了以下表格:
SELECT id, id_list FROM data LATERAL VIEW explode(id_list) dummy AS id;
+-------------+-------------------+
| id | id_list |
+-------------+-------------------+
| 1 | ["1","2","4","6"] |
+-------------|-------------------+
| 2 | ["1","2","4","6"] |
+-------------|-------------------+
| 4 | ["1","2","4","6"] |
+-------------|-------------------+
| 6 | ["1","2","4","6"] |
+-------------|-------------------+
| 7 | ["7"] |
+-------------|-------------------+
我想从id_list中移除id,以便得到如下表格:
+-------------+-------------------+
| id | id_list |
+-------------+-------------------+
| 1 | ["2","4","6"] |
+-------------|-------------------+
| 2 | ["1","4","6"] |
+-------------|-------------------+
| 4 | ["1","2","6"] |
+-------------|-------------------+
| 6 | ["1","2","4"] |
+-------------|-------------------+
| 7 | [] |
+-------------|-------------------+
如何在不使用UDF的情况下完成这个任务?