Skip to contents

This function allows to convert a table from a duckdb file to parquet format.
The following extensions are supported : "duckdb" or "db".

Two conversions possibilities are offered :

  • Convert to a single parquet file. Argument `path_to_parquet` must then be used;

  • Convert to a partitioned parquet file. Additionnal arguments `partition` and `partitioning` must then be used;

Usage

duckdb_to_parquet(
  path_to_duckdb,
  table_in_duckdb,
  path_to_parquet,
  partition = "no",
  ...
)

Arguments

path_to_duckdb

string that indicates the path to the duckdb file

table_in_duckdb

string that indicates the name of the table to convert in the duckdb file

path_to_parquet

string that indicates the path to the directory where the parquet file will be stored

partition

string ("yes" or "no" - by default) that indicates whether you want to create a partitioned parquet file. If "yes", `"partitioning"` argument must be filled in. In this case, a folder will be created for each modality of the variable filled in `"partitioning"`.

...

additional format-specific arguments, see arrow::write_parquet() and arrow::write_dataset() for more informations.

Value

A parquet file, invisibly

Examples


# Conversion from a local duckdb file to a single parquet file :

duckdb_to_parquet(
  path_to_duckdb = system.file("extdata","iris.duckdb",package = "parquetize"),
  table_in_duckdb = "iris",
  path_to_parquet = tempdir()
)
#> Reading data...
#> Writing data...
#>  The iris table from your duckdb file is available in parquet format under /tmp/Rtmp9Zjdsp
#> Writing data...


# Conversion from a local duckdb file to a partitioned parquet file  :

duckdb_to_parquet(
  path_to_duckdb = system.file("extdata","iris.duckdb",package = "parquetize"),
  table_in_duckdb = "iris",
  path_to_parquet = tempdir(),
  partition = "yes",
  partitioning =  c("Species")
)
#> Reading data...
#> Writing data...
#>  The iris table from your duckdb file is available in parquet format under /tmp/Rtmp9Zjdsp
#> Writing data...