Numpy isnan（）は、floatの配列で失敗します（pandasデータフレームの適用から）

Question 1

私はパンダのデータフレームの適用から出てくるフロートの配列（いくつかの通常の数、いくつかのナン）を持っています。

何らかの理由で、この配列でnumpy.isnanが失敗していますが、以下に示すように、各要素は浮動小数点数であり、numpy.isnanは各要素で正しく実行され、変数の型は間違いなくnumpy配列です。

どうしたの？！

set([type(x) for x in tester])
Out[59]: {float}

tester
Out[60]: 
array([-0.7000000000000001, nan, nan, nan, nan, nan, nan, nan, nan, nan,
   nan, nan, nan, nan, nan, nan, nan, nan, nan, nan, nan, nan, nan,
   nan, nan, nan, nan, nan, nan, nan, nan, nan, nan, nan, nan, nan,
   nan, nan, nan, nan, nan, nan, nan, nan, nan, nan, nan, nan, nan,
   nan, nan], dtype=object)

set([type(x) for x in tester])
Out[61]: {float}

np.isnan(tester)
Traceback (most recent call last):

File "<ipython-input-62-e3638605b43c>", line 1, in <module>
np.isnan(tester)

TypeError: ufunc 'isnan' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe''

set([np.isnan(x) for x in tester])
Out[65]: {False, True}

type(tester)
Out[66]: numpy.ndarray

Question 2

np.isnan ネイティブdtype（np.float64など）のNumPy配列に適用できます。

In [99]: np.isnan(np.array([np.nan, 0], dtype=np.float64))
Out[99]: array([ True, False], dtype=bool)

ただし、オブジェクト配列に適用するとTypeErrorが発生します。

In [96]: np.isnan(np.array([np.nan, 0], dtype=object))
TypeError: ufunc 'isnan' not supported for the input types, and the inputs could not be safely coerced to any supported types according to the casting rule ''safe''

あなたはパンダを持っているので、pd.isnull代わりに使うことができます-オブジェクトのNumPy配列またはネイティブdtypeを受け入れることができます：

In [97]: pd.isnull(np.array([np.nan, 0], dtype=float))
Out[97]: array([ True, False], dtype=bool)

In [98]: pd.isnull(np.array([np.nan, 0], dtype=object))
Out[98]: array([ True, False], dtype=bool)

Noneオブジェクト配列ではnull値とも見なされることに注意してください。

Question 3

np.isnan（）およびpd.isnull（）の優れた代替は

for i in range(0,a.shape[0]):
    if(a[i]!=a[i]):
       //do something here
       //a[i] is nan

ナンだけがそれ自身と等しくないからです。

Question 4

@unutbuの回答に加えて、パンダのnumpyオブジェクト配列をネイティブ（float64）型に強制できます。

import pandas as pd
pd.to_numeric(df['tester'], errors='coerce')

errors = 'coerce'を指定すると、数値に解析できない文字列を強制的にNaNにすることができます。列タイプはdtype: float64であり、isnanチェックは機能するはずです

Question 5

Pandasを使用してcsvファイルをインポートしてください

import pandas as pd

condition = pd.isnull(data[i][j])