File Formats Any Data Scientist Cares About
Piotr Wendykier and Sean Cheren
Efficient data import and representation is a key to any data science project dealing with large datasets. This talk will discuss import and export of Parquet, Arrow IPC and CSV files. These formats are part of the project aiming to add columnar data structures to Wolfram Language. A performance comparison with Python libraries will be presented.
Thanks for your feedback.