如何将嵌套列表作为新列添加到现有的Pandas数据框中

3

我已经创建了一个名为“df”的数据框,它看起来像这样:

   Name
0. School
1. Organisation 
2. Teacher 
3. Guest

现在我有三个列表。
1. A = ['','','',['12','4']]
2. B = ['','','',['3','8']]
3. status = ['','','','[['yes','no','yes'],['no','yes','no']]]
4. letter = ['', '', '', [[['K', 'L'], ['L'], ['L']], [['O'], ['P', 'O'], ['K']]]]

我想将这4个列表添加到我的现有数据框df中,它应该看起来像这样。
  Name            A1    A2   status  letter
0. School                 
1. Organisation          
2. Teacher       
3. Guest          12     3   yes     K
3. Guest          12     3   yes     L
3. Guest          12     3   no      L
3. Guest          12     3   yes     L
3. Guest           4     8   no      O
3. Guest           4     8   yes     P
3. Guest           4     8   yes     O
3. Guest           4     8   no      K

我已经尝试过 df['from']=fromdf['to']=to,但是这并没有给我期望中的表格。

我也尝试了以下方法:

dfa=pd.DataFrame({"Names":names})

def flat(nums):
    res = []
    index = []
    for i in range(len(nums)):
        if isinstance(nums[i], list):
            res.extend(nums[i])
            index.extend([i]*len(nums[i]))
        else:
            res.append(nums[i])
            index.append(i)
    return res,index

    x="A1"
    list_flat,list_index=flat(eval(x))
    dataframe = pd.DataFrame({x:list_flat},index=list_index)
    df = pd.concat([df['Names'],dataframe],axis=1,sort=False)

    x="A2"
    list_flat,list_index=flat(eval(x))
    dataframe = pd.DataFrame({x:list_flat},index=list_index)
    df= pd.concat([df,dataframe],axis=1,sort=False)

    x="status"
    list_flat,list_index=flat(eval(x))
    dataframe = pd.DataFrame({x:list_flat},index=list_index)
    df = pd.concat([df,dataframe],axis=1,sort=False)

    x="letter"
    list_flat,list_index=flat(eval(x))
    dataframe = pd.DataFrame({x:list_flat},index=list_index)
    df = pd.concat([df,dataframe],axis=1,sort=False)

但是,嵌套列表并没有像不同的行一样出现,而是像这样出现在同一行。
Name              A1    A2    status                 letter
0. School                 
1. Organisation          
2. Teacher       
3. Guest          12     3   ['yes','no','yes']     [['K','L'],['L'],['L']]
3. Guest           4     8   ['no','yes','no']      [['O'],['P','O'],['K']]

你能否将当前的输出结果添加到问题文本中?谢谢。 - Stefan Becker
1
为什么变量 letter 是二维的? - Jeril
@Jeril 这是一个 API 响应,它以二维形式呈现。 - Miffy
1个回答

4

您可以尝试以下操作:

name = ['School', 'Organization', 'Teacher', 'Guest']
A = ['','','',['12','4']]
B = ['','','',['3','8']]
status = ['','','',[['yes','no','yes'],['no','yes','no']]]
letter = [['', '', '', [[['K', 'L'], ['L'], ['L']], [['O'], ['P', 'O'], ['K']]]]]

final_list = []
for a, b, c, d, e in zip(name, A, B, status, letter[0]):
    if any([b, c, d, e]):
        for b1, c1, d1, e1 in zip(b, c, d, e):
            for d2, e2 in zip(d1, e1):
                for e3 in e2:
                    final_list.append([a, b1, c1, d2, e3])
    else:
        final_list.append([a, b, c, d, e])

df = pd.DataFrame(final_list, columns=['Name', 'A1', 'A2', 'status', 'letter'])

输出:

            Name  A1 A2 status letter
0         School                     
1   Organization                     
2        Teacher                     
3          Guest  12  3    yes      K
4          Guest  12  3    yes      L
5          Guest  12  3     no      L
6          Guest  12  3    yes      L
7          Guest   4  8     no      O
8          Guest   4  8    yes      P
9          Guest   4  8    yes      O
10         Guest   4  8     no      K

输出为空。 - Miffy
我能够获得输出,你尝试打印输出了吗? - Jeril
如果将letter [0]替换为letter,则它完全正常工作。非常感谢。 - Miffy

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接