Fundamentals of Scalable Data Science

Issued by Coursera

Authorized by IBM

This badge earner has proven a deep understanding of massive parallel data processing on ApacheSpark. They have mastered low-level functional programming using python on the Resilient Distributed Dataset (RDD) API and mastered relational data processing using Apache SparkSQL & the DataFrame API. Earners understand how data processing & machine learning can be parallelized using scale-out clusters, & can compute statistical measures, integrate & transform data, & create advanced visualizations.

Type Validation
Level Intermediate
Time Days
Cost Paid

Additional Details

Skills

Apache Spark
Data Engineer
Data Science
Descriptive Statistics
Functional Programming
Internet Of Things
PWID-B0569900

Earning Criteria

Complete the Coursera course "Fundamentals of Scalable Data Science" including all hands-on labs and assignments.
Pass the Coursera course assessment criteria.