Pandas
Use Pandas to analyze data in Firebolt.
Pandas is a powerful open-source data analysis and manipulation library for Python. It provides data structures like DataFrames and Series, which make it easy to work with structured data. Pandas is widely used in data science, machine learning, and statistical analysis due to its flexibility and ease of use.
This guide will show you how to connect Pandas to Firebolt, allowing you to perform data analysis and manipulation on your Firebolt data.
Prerequisites
Before you begin, ensure you have the following prerequisites:
- Python: You need to have Python installed on your machine. You can download it from python.org.
- Firebolt account: You need an active Firebolt account. If you do not have one, you can sign up for free.
- Firebolt Database and Table: Make sure you have a Firebolt database and table with data ready for querying.
- Firebolt Service Account: Create a service account in Firebolt and note its id and secret.
Connecting Pandas to Firebolt
-
Install the required libraries:
-
Connect to Firebolt using a SQLAlchemy engine:
client_id
: client ID of your service account.client_secret
: client secret of your service account.account_name
: The name of your Firebolt account.database
: The name of the database to connect to.engine
: The name of the engine to run SQL queries on.
-
Load data into Pandas using a SQLAlchemy engine:
Done! You can now use Pandas to analyze and manipulate data from Firebolt. You can perform various operations like filtering, aggregating, and visualizing data using Pandas’ powerful features.
Further reading
- Learn more about Pandas and its capabilities.
- Explore the Firebolt SQLAlchemy documentation for more details on using Firebolt with SQLAlchemy.