Python Pandas - 指示除第一次出现之外的重复索引值
要指示除第一次出现之外的重复索引值,请使用.首先使用带有值的keep参数。index.duplicated()
首先,导入所需的库-
import pandas as pd
创建具有一些重复项的索引-
index = pd.Index(['Car','Bike','Airplane','Ship','Airplane'])
显示索引-
print("Pandas Index with duplicates...\n",index)
将重复的索引值指示为True,但第一次出现除外。将“keep”参数设置为“first”-
print("\nIndicating duplicate values except the first occurrence...\n", index.duplicated(keep='first'))
示例
以下是代码-
import pandas as pd #Creatingtheindexwithsomeduplicates index = pd.Index(['Car','Bike','Airplane','Ship','Airplane']) #Displaytheindex print("Pandas Index with duplicates...\n",index) #Returnthedtypeofthedata print("\nThe dtype object...\n",index.dtype) #getthedimensionsofthedata print("\nGet the dimensions...\n",index.ndim) #IndicateduplicateindexvaluesasTrue,exceptthefirstoccurrence # Set the "keep" 参数为 "first" print("\nIndicating duplicate values except the first occurrence...\n", index.duplicated(keep='first'))输出结果
这将产生以下代码-
Pandas Index with duplicates... Index(['Car', 'Bike', 'Airplane', 'Ship', 'Airplane'], dtype='object') The dtype object... object Get the dimensions... 1 Indicating duplicate values except the first occurrence... [False False False False True]