Course Outline
Introduction
- Apache Arrow vs Parquet
Installing and Configuring Apache Arrow
Overview of Apache Arrow Features and Architecture
Exploring Data with Pandas and Apache Arrow
Exploring Data with Spark and Apache Arrow
Exploring Data with R and Apache Arrow
Exploring Data with MapD and Apache Arrow
Other Data Analysis Integrations
- PySpark, Parquet files on S3, and Oracle tables and Elasticsearch indices
Troubleshooting
Summary and Conclusion
Requirements
- A basic undersanding of SQL
- Familiarity with Python or R
- Some familiarity with Apache Spark
Testimonials (7)
The trainer adapted the materials and contents to what he thought would be best for us and he succeeded. The quality of the training was excellent.
Jorge Sanchez Hernandez - CSMART - Carnival
Course - QGIS for Geographic Information System
The trainers flexibility, showing us everything we needed for our work, but also teaching the basics and giving some very good tips. Saadoon and Mateen were great!
Ana Vicente - CSMART - Carnival
Course - QGIS for Geographic Information System
A lot of patience
Mateusz - WestWind Energy Polska Sp. z o.o.
Machine Translated
Professional and very practical, usuefull in a daily work
Jozefin Rékasi - SC Automobile Dacia SA
Course - Advanced Data Analysis with TIBCO Spotfire
I liked Pablo's style, the fact that he covered a lot of subjects from report design , customization with html to implementing simple ML algortithms. Good balance theoretical information / exercices. Pablo really covered all topics i was interested in and gave comprehensive answers to my questions.
Cristian Tudose - SC Automobile Dacia SA
Course - Advanced Data Analysis with TIBCO Spotfire
It covered the areas i said i was interested in before the course: data relationships, using python script. Connecting to databases will be covered in the advanced module.
Cristian Tudose - SC Automobile Dacia SA
Course - Introduction to Spotfire
Good teaching skils, good knowledge of the subjekts