Understand options for loading data using a data load wizard.
AKIAIOSFODNN7EXAMPLE
).wJalrXUtnFEMI/K7MDENG/bPxRfiCYEXAMPLEKEY
).s3://firebolt-publishing-public/help_center_assets/firebolt_sample_dataset/
.levels.csv
.0%
, meaning any error will stop the loading process. If you wish to continue loading despite errors, set Max errors per file to a non-zero value. For example, entering 10%
or 10
allows the process to continue until errors affect 10%
of the rows.
ON
if the columns in your data can contain NULL
values. If this value is toggled off for a column, and that column contains NULL
values, then the wizard will generate an error and stop loading.
ON
for the columns you want to include in your primary index.
WHERE
, JOIN
, GROUP_BY
, and other clauses used for sorting. Selecting the best primary index, which is a sparse index, can reduce query run times significantly by reducing the data set that the query scans. A primary index also allows Firebolt to manage updates, deletions and insertions to tables and provide optimal query performance.column_1
first, then select column_3
, then column_3
will be added as a primary index after column_1
. This means column_1
will be used first as a sparse index, followed by column_3
. If you choose more than one primary index, the order of sorting appears next to the toggle switch under the Primary Index column. In the previous example, the number 1
appears next to column_1
and a number 2
appears next to column_3
. To achieve optimal results, choose indexes in the ascending order of their cardinality, or the number of unique values. Start with the column that has the lowest number of unique values as your first primary index, followed by the column with the next cardinality. For more information about how to choose a primary index, see Primary index.