我在准备数据科学奥林匹克竞赛时遇到了一些小问题。我所做的只是将一行中值在2到8之间的值转换为“好”或“坏”,使用了一个分箱方法,然后我使用标签编码器将它们转换为1或0。
当运行以下代码时:
我遇到了以下错误:
Traceback (most recent call last):
File "main.py", line 21, in <module>
X = data.drop(data["quality"], axis=1)
File "/home/runner/.local/share/virtualenvs/python3/lib/python3.8/site-packages/pandas/core/frame.py", line 3990, in drop
return super().drop(
File "/home/runner/.local/share/virtualenvs/python3/lib/python3.8/site-packages/pandas/core/generic.py", line 3936, in drop
obj = obj._drop_axis(labels, axis, level=level, errors=errors)
File "/home/runner/.local/share/virtualenvs/python3/lib/python3.8/site-packages/pandas/core/generic.py", line 3970, in _drop_axis
new_axis = axis.drop(labels, errors=errors)
File "/home/runner/.local/share/virtualenvs/python3/lib/python3.8/site-packages/pandas/core/indexes/base.py", line 5018, in drop
raise KeyError(f"{labels[mask]} not found in axis")
KeyError: '[0 0 0 ... 1 0 1] not found in axis'
错误提示我的行值在轴中未找到,但我已经指定了axis为1,所以它不应该被删除吗?
回答:
实际上,您的Python代码中有一个错误,drop函数需要列名作为列表,而不是列本身,尝试以下代码应该可以正常工作:
# 创建我们的特征和结果集
y = data["quality"]
X = data.drop(["quality"], axis=1)
另外,在删除之前,您必须将该列复制到y中,否则会因为’quality’列已被删除而报错。