Course

Data Engineering: Level 01

Data engineering is the aspect of data science that focuses on practical applications of data collection and analysis. For all the work that data scientists do to answer questions using large sets of information, there must be mechanisms for collecting and validating that information. In order for that work to ultimately have any value, there also have to be mechanisms for applying it to real-world operations in some way. Those are both engineering tasks: the application of science to practical, functioning systems. Data engineers focus on the applications and harvesting of big data. Their role doesn’t include a great deal of analysis or experimental design. Instead, they are out where the rubber meets the road (literally, in the case of self-driving vehicles), creating interfaces and mechanisms for the flow and access of information.

6 Lessons
Outcomes

By the end of the course, learners will be able to:

  • Basic terminology pertinent to Data Engineering
  • Components of Data Engineering pipeline
  • Familiarization with Big Data concepts and platforms like Hadoop, Spark, Sqoop etc.
  • Creation of a backend pipeline using Open Source software

 

Level: 01
Duration: 25 Hours
Pre-requisites: NA
What’s next: Data Governance: Level 0