Read data
Read Parquet file in Python
Parquet files are designed to store large volumes of data in columnar storage format. Parquet files can be read in Python code using pandas and pyarrow packages.
Please check pandas.read_parquet function for more details.
parquetpandas
Required packages
You need below packages to use the code generated by recipe. All packages are automatically installed in MLJAR Studio.
pandas>=1.0.0
pyarrow>=16.1.0
Interactive recipe
You can use below interactive recipe to generate code. This recipe is available in MLJAR Studio.
Python code
# Python code will be here
Code explanation
- Read Parquet file from provided path.
- Display shape of loaded data.
- Display first rows of data.
Additionally, you can specify columns
list as argument in read_parquet()
function, and only columns from list will be loaded.
Read data cookbook
Code recipes from Read data cookbook.