Generally, upon installation, pandas is installed as a Python module in a standard location for third-party Python modules. In the following table, you will see the standard installation location for Unix/ macOS and the Windows platform:
Platform |
Standard installation location |
Example |
Unix/macOS |
prefix/lib/pythonX.Y/site-packages |
/usr/local/lib/python2.7/site-packages |
Windows |
prefixLibsite-packages |
C:Python27Libsite-packages |
If Python installation was done with Anaconda, then the pandas module can be found in the Anaconda directory, within a similar file path: Anaconda3pkgspandas-0.23.4-py37h830ac7b_0Libsite-packagespandas.
Now that we have had a look at the module on third-party Python modules, we will understand the file hierarchy. There are eight types of file in the installed Pandas library. The installed files follow a specific hierarchy, which is described here:
- pandas/core: This contains files for fundamental data structures, such as Series/DataFrames and related functionalities.
- pandas/src: This contains Cython and C code for implementing fundamental algorithms.
- pandas/io: This contains input/output tools for handling different file formats, such as flat files, Excel, HDF5, and SQL.
- pandas/tools: This contains auxiliary data algorithms, merge and join routines, concatenation, pivot tables, and more. This module primarily serves data-manipulation operations.
- pandas/sparse: This contains sparse versions of data structures, such as series, DataFrame, Panels, and more.
- pandas/stats: This contains linear and panel regression, moving window regression, and several other statistical functions. This should be replaced by functionality in statsmodels.
- pandas/util: This contains utilities and development and testing tools.
- pandas/rpy: This contains the RPy2 interface for connecting to R, thereby widening the scope of analytical operations on data.
For more information, see: http://pandas.pydata.org/developers.html.