Difference between revisions of "Pandas notes"

From Simson Garfinkel
Jump to navigationJump to search
(Created page with "==Memory Ideas== print the data frame types: df.dtypes print if the data frame columns are dense are sparse: df.ftypes Other ideas: df.info() df.info(memory_...")
 
m
Line 17: Line 17:
     surveys_df['record_id'].dtype
     surveys_df['record_id'].dtype


Missing values:
    any missing values = df.isnull().values.any()
    total missing values = df.isnull().sum()


References:
References:

Revision as of 15:05, 26 June 2018

Memory Ideas

print the data frame types:

   df.dtypes

print if the data frame columns are dense are sparse:

   df.ftypes

Other ideas:

   df.info()
   df.info(memory_usage='deep')
   df.memory_usage(deep=True)
   sys.getsizeof(df)
   

Convert the record_id field from an integer to a float

   surveys_df['record_id'] = surveys_df['record_id'].astype('float64')
   surveys_df['record_id'].dtype

Missing values:

   any missing values = df.isnull().values.any()
   total missing values = df.isnull().sum()

References: