Through instructor-led discussion and interactive, hands-on exercises, participants will navigate the Hadoop
ecosystem, and develop concrete skills such as:
- How to identify potential business use cases where data science can provide impactful results.
- How to obtain, clean and combine disparate data sources to create a coherent picture for analysis.
- What statistical methods to leverage for data exploration that will provide critical insight into your data?
- Where and when to leverage Hadoop streaming and Apache Flume for data science pipelines.
- What machine learning technique to use for a particular data science project.