Room: Ballroom A+B
This talk will describe the activities of a data scientist: data ingestion (ETL), preprocessing, data understanding and predictive analytics. PostgreSQL will be presented as the tool that supports many of those tasks, by itself or using extensions. Data ingestion: Foreign Data Wrappers for accessing other databases, NoSQL data stores and Hadoop; procedural languages for access to other systems Preprocessing: Standard and advanced SQL Data understanding: descriptive statistics with SQL, visualization with PL/R Predictive analytics: machine learning with PL/R and PL/Python, model application in the database The presenter is a data scientist with many years of analytics experience. He's been a fan of PostgreSQL since the 1990s and uses it wherever he can.