IBM
IBM Data Engineering Professional Certificate

New! Discover how 91% of learners achieved at least one positive career outcome. Learn more.

IBM

IBM Data Engineering Professional Certificate

Prepare for a career as a Data Engineer. Build job-ready skills – and must-have AI skills – for an in-demand career. Earn a credential from IBM. No prior experience required.

IBM Skills Network Team
Muhammad Yahya
Abhishek Gagneja

Instructors: IBM Skills Network Team

Included with Coursera Plus

Earn a career credential that demonstrates your expertise

(6,272 reviews)

Beginner level

Recommended experience

6 months at 10 hours a week
Flexible schedule
Earn a career credential
Share your expertise with employers
Earn a career credential that demonstrates your expertise

(6,272 reviews)

Beginner level

Recommended experience

6 months at 10 hours a week
Flexible schedule
Earn a career credential
Share your expertise with employers

What you'll learn

  • Master the most up-to-date practical skills and knowledge data engineers use in their daily roles

  • Learn to create, design, & manage relational databases & apply database administration (DBA) concepts to RDBMSs such as MySQL, PostgreSQL, & IBM Db2 

  • Develop working knowledge of NoSQL & Big Data using MongoDB, Cassandra, Cloudant, Hadoop, Apache Spark, Spark SQL, Spark ML, and Spark Streaming 

  • Implement ETL & Data Pipelines with Bash, Airflow & Kafka; architect, populate, deploy Data Warehouses; create BI reports & interactive dashboards

Overview

What’s included

Shareable certificate

Add to your LinkedIn profile

Taught in English
179 practice exercises

Professional Certificate - 16 course series

What you'll learn

  • List basic skills required for an entry-level data engineering role.

  • Discuss various stages and concepts in the data engineering lifecycle.

  • Describe data engineering technologies such as Relational Databases, NoSQL Data Stores, and Big Data Engines.

  • Summarize concepts in data security, governance, and compliance.

Skills you'll gain

Data Pipelines, Extract, Transform, Load, Data Warehousing, Data Architecture, Data Security, Data Store, Relational Databases, Big Data, Data Governance, Apache Spark, SQL, NoSQL, Data Science, Databases, Data Lakes, and Apache Hadoop

What you'll learn

  • Develop a foundational understanding of Python programming by learning basic syntax, data types, expressions, variables, and string operations.

  • Apply Python programming logic using data structures, conditions and branching, loops, functions, exception handling, objects, and classes.

  • Demonstrate proficiency in using Python libraries such as Pandas and Numpy and developing code using Jupyter Notebooks.

  • Access and extract web-based data by working with REST APIs using requests and performing web scraping with BeautifulSoup.

Skills you'll gain

Python Programming, Pandas (Python Package), Web Scraping, NumPy, Data Structures, Jupyter, JSON, Data Manipulation, Object Oriented Programming (OOP), Application Programming Interface (API), Data Import/Export, Scripting, Data Processing, Computer Programming, Restful API, Automation, Programming Principles, and Data Analysis

What you'll learn

  • Demonstrate your skills in Python for working with and manipulating data

  • Implement webscraping and use APIs to extract data with Python

  • Play the role of a Data Engineer working on a real project to extract, transform, and load data

  • Use Jupyter notebooks and IDEs to complete your project

Skills you'll gain

Python Programming, Extract, Transform, Load, Data Manipulation, Web Scraping, Data Processing, Code Review, Restful API, Integrated Development Environments, Databases, SQL, Style Guides, Unit Testing, Data Transformation, and Application Programming Interface (API)

What you'll learn

  • Describe data, databases, relational databases, and cloud databases.

  • Describe information and data models, relational databases, and relational model concepts (including schemas and tables). 

  • Explain an Entity Relationship Diagram and design a relational database for a specific use case.

  • Develop a working knowledge of popular DBMSes including MySQL, PostgreSQL, and IBM DB2

Skills you'll gain

Relational Databases, SQL, Database Design, PostgreSQL, MySQL, Data Manipulation, Database Architecture and Administration, IBM DB2, Data Management, Databases, Database Management Systems, Command-Line Interface, Data Integrity, and Data Modeling

What you'll learn

  • Analyze data within a database using SQL and Python.

  • Create a relational database and work with multiple tables using DDL commands.

  • Construct basic to intermediate level SQL queries using DML commands.

  • Compose more powerful queries with advanced SQL techniques like views, transactions, stored procedures, and joins.

Skills you'll gain

SQL, Pandas (Python Package), Jupyter, Data Analysis, Databases, Data Manipulation, Relational Databases, Stored Procedure, Transaction Processing, Query Languages, and Python Programming

What you'll learn

  • Describe the Linux architecture and common Linux distributions and update and install software on a Linux system.

  • Perform common informational, file, content, navigational, compression, and networking commands in Bash shell.

  • Develop shell scripts using Linux commands, environment variables, pipes, and filters.

  • Schedule cron jobs in Linux with crontab and explain the cron syntax. 

Skills you'll gain

Linux Commands, Shell Script, Linux, Unix, File Management, Scripting, Unix Commands, Automation, Operating Systems, Network Protocols, Software Installation, Ubuntu, Command-Line Interface, Bash (Scripting Language), Linux Servers, and Scripting Languages

What you'll learn

  • Create, query, and configure databases and access and build system objects such as tables.

  • Perform basic database management including backing up and restoring databases as well as managing user roles and permissions. 

  • Monitor and optimize important aspects of database performance. 

  • Troubleshoot database issues such as connectivity, login, and configuration and automate functions such as reports, notifications, and alerts. 

Skills you'll gain

Database Management, Database Architecture and Administration, Database Administration, MySQL, Relational Databases, Database Design, Database Systems, Encryption, Disaster Recovery, Role-Based Access Control (RBAC), User Accounts, System Monitoring, Performance Tuning, Operational Databases, Data Storage Technologies, IBM DB2, and PostgreSQL

What you'll learn

  • Describe and contrast Extract, Transform, Load (ETL) processes and Extract, Load, Transform (ELT) processes.

  • Explain batch vs concurrent modes of execution.

  • Implement ETL workflow through bash and Python functions.

  • Describe data pipeline components, processes, tools, and technologies.

Skills you'll gain

Extract, Transform, Load, Data Pipelines, Apache Airflow, Apache Kafka, Shell Script, Data Transformation, Big Data, Performance Tuning, Data Processing, Data Mart, Data Integration, Data Migration, Command-Line Interface, Web Scraping, Scalability, Unix Shell, and Data Warehousing

What you'll learn

  • Job-ready data warehousing skills in just 6 weeks, supported by practical experience and an IBM credential.

  • Design and populate a data warehouse, and model and query data using CUBE, ROLLUP, and materialized views.

  • Identify popular data analytics and business intelligence tools and vendors and create data visualizations using IBM Cognos Analytics.

  • How to design and load data into a data warehouse, write aggregation queries, create materialized query tables, and create an analytics dashboard.

Skills you'll gain

Data Warehousing, Snowflake Schema, Data Lakes, Data Mart, Star Schema, IBM DB2, SQL, Query Languages, Extract, Transform, Load, PostgreSQL, Database Systems, Database Design, Data Cleansing, Data Integration, Data Architecture, Data Quality, Data Validation, and Data Modeling

What you'll learn

  • Explore the purpose of analytics and Business Intelligence (BI) tools

  • Discover the capabilities of IBM Cognos Analytics and Google Looker Studio

  • Showcase your proficiency in analyzing DB2 data with IBM Cognos Analytics

  • Create and share interactive dashboards using IBM Cognos Analytics and Google Looker Studio

Skills you'll gain

IBM Cognos Analytics, Data Visualization Software, Looker (Software), Dashboard, Interactive Data Visualization, Analytics, Data Presentation, Business Intelligence, Data Visualization, and Business Intelligence Software

What you'll learn

  • Differentiate among the four main categories of NoSQL repositories.

  • Describe the characteristics, features, benefits, limitations, and applications of the more popular Big Data processing tools.

  • Perform common tasks using MongoDB tasks including create, read, update, and delete (CRUD) operations.

  • Execute keyspace, table, and CRUD operations in Cassandra.

Skills you'll gain

NoSQL, Apache Cassandra, MongoDB, Data Modeling, Distributed Computing, Query Languages, Scalability, JSON, IBM Cloud, Database Management, Databases, Data Manipulation, and Database Architecture and Administration

What you'll learn

  • Explain the impact of big data, including use cases, tools, and processing methods.

  • Describe Apache Hadoop architecture, ecosystem, practices, and user-related applications, including Hive, HDFS, HBase, Spark, and MapReduce.

  • Apply Spark programming basics, including parallel programming basics for DataFrames, data sets, and Spark SQL.

  • Use Spark’s RDDs and data sets, optimize Spark SQL using Catalyst and Tungsten, and use Spark’s development and runtime environment options.

Skills you'll gain

Apache Spark, Big Data, Distributed Computing, Apache Hadoop, Scalability, IBM Cloud, Data Processing, Apache Hive, Debugging, PySpark, Docker (Software), Performance Tuning, Kubernetes, and Data Transformation

What you'll learn

  • Describe ML, explain its role in data engineering, summarize generative AI, discuss Spark's uses, and analyze ML pipelines and model persistence.

  • Evaluate ML models, distinguish between regression, classification, and clustering models, and compare data engineering pipelines with ML pipelines.

  • Construct the data analysis processes using Spark SQL, and perform regression, classification, and clustering using SparkML.

  • Demonstrate connecting to Spark clusters, build ML pipelines, perform feature extraction and transformation, and model persistence.

Skills you'll gain

Machine Learning, Apache Spark, Extract, Transform, Load, Regression Analysis, Supervised Learning, PySpark, Data Pipelines, Unsupervised Learning, Data Transformation, Feature Engineering, Predictive Modeling, Applied Machine Learning, Data Processing, Generative AI, and Apache Hadoop

What you'll learn

  • Demonstrate proficiency in skills required for an entry-level data engineering role.

  • Design and implement various concepts and components in the data engineering lifecycle such as data repositories.

  • Showcase working knowledge with relational databases, NoSQL data stores, big data engines, data warehouses, and data pipelines.

  • Apply skills in Linux shell scripting, SQL, and Python programming languages to Data Engineering problems.

Skills you'll gain

Data Warehousing, SQL, Extract, Transform, Load, MySQL, Apache Spark, Data Pipelines, Dashboard, Big Data, Data Analysis, MongoDB, NoSQL, Data Infrastructure, PostgreSQL, Relational Databases, Python Programming, Data Architecture, IBM DB2, IBM Cognos Analytics, Applied Machine Learning, and Databases

What you'll learn

  • Leverage various generative AI tools and techniques in data engineering processes across industries

  • Implement various data engineering processes such as data generation, augmentation, and anonymization using generative AI tools

  • Practice generative AI skills in hands-on labs and projects for data warehouse schema design and infrastructure setup

  • Evaluate real-world case studies showcasing the successful application of Generative AI for ETL and data repositories

Skills you'll gain

Generative AI, Data Synthesis, Data Analysis, Extract, Transform, Load, Data Warehousing, Data Infrastructure, Query Languages, Data Ethics, Data Architecture, Data Quality, Data Mining, Data Pipelines, Database Design, Responsible AI, and Artificial Intelligence

What you'll learn

  • Describe the role of a data engineer and some career path options as well as the prospective opportunities in the field.

  • Explain how to build a foundation for a job search, including researching job listings, writing a resume, and making a portfolio of work.

  • Summarize what a candidate can expect during a typical job interview cycle, different types of interviews, and how to prepare for interviews.

  • Explain how to give an effective interview, including techniques for answering questions and how to make a professional personal presentation.

Skills you'll gain

Interviewing Skills, Professional Networking, Data Pipelines, Professional Development, Data Infrastructure, Communication Strategies, LinkedIn, Technical Communication, Data Strategy, Verbal Communication Skills, and Data Ethics

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Build toward a degree

When you complete this Professional Certificate, you may be able to have your learning recognized for credit if you are admitted and enroll in one of the following online degree programs.¹

 
ACE Logo

This Professional Certificate has ACE® recommendation. It is eligible for college credit at participating U.S. colleges and universities. Note: The decision to accept specific credit recommendations is up to each institution. 

Instructors

IBM Skills Network Team
84 Courses1,586,121 learners
Muhammad Yahya
IBM
5 Courses94,221 learners
Abhishek Gagneja
IBM
6 Courses244,274 learners
Shubhra Das
7 Courses51,347 learners
Romeo Kienzler
IBM
10 Courses795,789 learners
Joseph Santarcangelo
IBM
36 Courses2,205,288 learners
Rav Ahuja
IBM
56 Courses4,409,729 learners
Hima Vasudevan
IBM
4 Courses635,444 learners
Sandip Saha Joy
IBM
5 Courses654,788 learners
Priya Kapoor
IBM
1 Course229,731 learners
Steve Ryan
IBM
12 Courses731,261 learners
Lavanya Thiruvali Sunderarajan
8 Courses230,303 learners
Aije Egwaikhide
IBM
6 Courses756,464 learners
Yan Luo
IBM
7 Courses380,831 learners
Ramesh Sannareddy
IBM
15 Courses454,355 learners

Offered by

IBM

Compare with similar products

Rating
Level
Skills
Tools
Last updated
Number of practice exercises
Degree eligibility
Part of Coursera Plus

You might also like

Why people choose Coursera for their career

Coursera Plus

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Frequently asked questions

¹ Median salary and job opening data are sourced from Lightcast™ Job Postings Report. Content Creator, Machine Learning Engineer and Salesforce Development Representative (1/1/2024 - 12/31/2024) All other job roles (11/1/2024 - 11/1/2025)