我有一个类似这样的数据框:
part part_parent
0 part1 NaN
1 part2 part1
2 part3 part2
3 part4 part3
4 part5 part2
我需要添加一个额外的列层级,像这样:
part part_parent hierarchy
0 part1 NaN part1
1 part2 part1 part1/part2/
2 part3 part2 part1/part2/part3/
3 part4 part3 part1/part2/part3/part4
4 part5 part2 part1/part2/part5
创建输入/输出DataFrame的字典:
from numpy import nan
df1 = pd.DataFrame({'part': {0: 'part1', 1: 'part2', 2: 'part3', 3: 'part4', 4: 'part5'},
'part_parent': {0: nan, 1: 'part1', 2: 'part2', 3: 'part3', 4: 'part2'}})
df2 = pd.DataFrame({'part': {0: 'part1', 1: 'part2', 2: 'part3', 3: 'part4', 4: 'part5'},
'part_parent': {0: nan, 1: 'part1', 2: 'part2', 3: 'part3', 4: 'part2'},
'hierarchy': {0: 'part1',
1: 'part1/part2/',
2: 'part1/part2/part3/',
3: 'part1/part2/part3/part4',
4: 'part1/part2/part5'}})
<注意:> 我看到有几个与解决此问题相关的NetworkX
线程,但我无法做到这一点。任何帮助将不胜感激。