我正在处理一个表格中的两列。
+-------------+--------------------------------------------------------------+
| Area Name | Code Description |
+-------------+--------------------------------------------------------------+
| N Hollywood | VIOLATION OF RESTRAINING ORDER |
| N Hollywood | CRIMINAL THREATS - NO WEAPON DISPLAYED |
| N Hollywood | CRIMINAL THREATS - NO WEAPON DISPLAYED |
| N Hollywood | ASSAULT WITH DEADLY WEAPON, AGGRAVATED ASSAULT |
| Southeast | ASSAULT WITH DEADLY WEAPON, AGGRAVATED ASSAULT |
| West Valley | CRIMINAL THREATS - NO WEAPON DISPLAYED |
| West Valley | CRIMINAL THREATS - NO WEAPON DISPLAYED |
| 77th Street | RAPE, FORCIBLE |
| Foothill | CRM AGNST CHLD (13 OR UNDER) (14-15 & SUSP 10 YRS OLDER)0060 |
| N Hollywood | VANDALISM - FELONY ($400 & OVER, ALL CHURCH VANDALISMS) 0114 |
+-------------+--------------------------------------------------------------+
我正在使用Groupby和value_counts来查找区域名称下的代码描述。
df.groupby(['Area Name'])['Code Description'].value_counts()
有没有办法只查看每个区域名称的前n个值?如果我在上面的代码中附加
.nlargest(3)
,它只返回一个区域名称的结果。+---------------------------------------------------------------------------------+
| Wilshire SHOPLIFTING-GRAND THEFT ($950.01 & OVER) 7 |
+---------------------------------------------------------------------------------+