Data Science with Andreas Lauschke, Session#11: Managing and Analyzing Large Data
Andreas Lauschke
Andreas Lauschke, a senior mathematical programmer, live-demos key Wolfram Language features that are very useful in data science. In this eleventh session, he explores the handling of large data, that means data that is too large to fit into kernel memory. Andreas' dataset is the Amazon customer reviews dataset which is about 70 GB in size. He demonstrates some data preprocessing strategies for targeted data extraction, shows several smart queries against that data, and provides some visualizations.
Thanks for your feedback.