Read data

Read Parquet file in Python

Parquet files are designed to store large volumes of data in columnar storage format. Parquet files can be read in Python code using pandas and pyarrow packages.

Please check pandas.read_parquet function for more details.

parquetpandas

Required packages

You need below packages to use the code generated by recipe. All packages are automatically installed in MLJAR Studio.

pandas>=1.0.0

pyarrow>=16.1.0

Interactive recipe

You can use below interactive recipe to generate code. This recipe is available in MLJAR Studio.

Python code

# Python code will be here

Code explanation

  1. Read Parquet file from provided path.
  2. Display shape of loaded data.
  3. Display first rows of data.

Additionally, you can specify columns list as argument in read_parquet() function, and only columns from list will be loaded.

« Previous
Read HTML
Next »
Read SPSS