• In general when we deal with data pandas libray is used very commonly due to some important functions in data science

    • data analysis
    • data cleaning
    • data exploration
    • data manipulation
  • Pandas – Panel Data and python data analysis it’s a multidimensional data involving measurements over time

  • Pandas alone cannot perform. It is built on numPy, as it can also handle ndimensional array. So both libraries required

  • Features – series obj & data frame,aligns data, slicing, indexing, subseting, handles missing data, groups by functionality

  • Features – merging & joining, labeling of axes hierarchially, time-series functionality, reshaping & robust input/output tool

  • Pandas – great for > 500k rows, works great for tabular data, arbitrary matrix & time series matrix

  • Numpy – < 500k rows, however memory efficinet