For IndividualsFor BusinessesFor UniversitiesFor Governments
Coursera
  • All DegreesExplore Bachelor’s & Master’s degrees
  • Bachelor’s DegreesExplore master’s degrees from leading universities
  • Master’s DegreesExplore Computer Science & Engineering degrees
  • Postgraduate StudiesDeepen your expertise with postgraduate learning
  • MasterTrack™Earn credit towards a Master’s degree
  • University CertificatesAdvance your career with graduate-level learning
Find your New Career
  • Browse
  • Top Courses
  • Log In
  • Join for Free
    Coursera
    • Browse
    • Apache Spark

    Filter by

    80 results for "apache spark"

    • Placeholder
      IBM Skills Network

      IBM Data Engineering

      Skills you'll gain: Data Management, Databases, Data Architecture, Data Structures, Big Data, Database Theory, SQL, Apache, Database Administration, Extract, Transform, Load, Python Programming, Data Model, Database Application, Data Warehousing, Data Analysis, NoSQL, Data Engineering, Distributed Computing Architecture, Database Design, Operating Systems, System Programming, System Software, Programming Principles, Statistical Programming, Algebra, Computer Architecture, PostgreSQL, Applied Machine Learning, Correlation And Dependence, Feature Engineering, General Statistics, Graph Theory, Machine Learning, Machine Learning Algorithms, Machine Learning Software, Regression, Statistical Analysis, Statistical Machine Learning, Data Visualization, Data Visualization Software, Basic Descriptive Statistics, Exploratory Data Analysis, Cloud Applications, Cloud Computing, Data Science, DevOps, Kubernetes, Leadership and Management, Network Architecture, Network Security, Other Programming Languages, Professional Development, Security Engineering, Algorithms, Computational Logic, Computational Thinking, Computer Networking, Computer Programming, Computer Programming Tools, IBM Cloud, Linux, Mathematical Theory & Analysis, Mathematics, Microarchitecture, Project Management, Security Strategy, Software Architecture, Software Engineering, Strategy and Operations, Theoretical Computer Science

      4.6

      (41.3k reviews)

      Beginner · Professional Certificate · 3-6 Months

    • Placeholder
      Google Cloud

      BigQuery Fundamentals for Redshift Professionals

      Intermediate · Course · 1-3 Months

    • Placeholder
      Databricks

      Data Science with Databricks for Data Analysts

      Skills you'll gain: Data Management, Apache, Algorithms, Computer Programming, Machine Learning, Probability & Statistics, Theoretical Computer Science, Data Analysis, Mathematics, Big Data, Databases, SQL, Data Science, Statistical Programming, Exploratory Data Analysis, Machine Learning Algorithms, Feature Engineering, Applied Machine Learning, General Statistics, Basic Descriptive Statistics, Extract, Transform, Load, Data Structures, Dimensionality Reduction, Business Analysis, Statistical Analysis

      4.5

      (470 reviews)

      Intermediate · Specialization · 3-6 Months

    • Placeholder
      Microsoft

      Microsoft Azure Data Engineering Associate (DP-203)

      Skills you'll gain: Cloud Computing, Microsoft Azure, Data Management, Big Data, Extract, Transform, Load, Data Warehousing, Cloud Storage, Databases, Data Analysis, Computer Networking, Statistical Programming, Apache, Accounting, Business Analysis, Cloud Infrastructure, Computer Architecture, Financial Analysis, Network Architecture, SQL, Computer Programming, Continuous Delivery, Continuous Integration, DevOps, NoSQL, Business Psychology, Entrepreneurship, Leadership and Management, Organizational Development, Application Development, Data Architecture, Network Security, Security Engineering, Security Strategy, Software Engineering, Software Engineering Tools

      4.3

      (544 reviews)

      Intermediate · Professional Certificate · 3-6 Months

    • Placeholder
      IBM Skills Network

      Introduction to Big Data with Spark and Hadoop

      Skills you'll gain: Apache, Big Data, Data Architecture, Distributed Computing Architecture, Computer Architecture, Data Management, Cloud Applications, Cloud Computing, Data Analysis, Data Warehousing, Database Administration, Databases, DevOps, Extract, Transform, Load, Kubernetes, Network Architecture, Other Programming Languages, SQL

      4.3

      (186 reviews)

      Beginner · Course · 1-3 Months

    • Placeholder
      École Polytechnique Fédérale de Lausanne

      Big Data Analysis with Scala and Spark

      Skills you'll gain: Apache, Big Data, Computer Programming, Data Management, Data Engineering, Other Programming Languages, Data Analysis, Data Analysis Software, SQL, Scala Programming

      4.6

      (2.6k reviews)

      Intermediate · Course · 1-4 Weeks

    • Placeholder
      Placeholder
      University of California, Davis

      Distributed Computing with Spark SQL

      Skills you'll gain: Data Management, Apache, Big Data, Databases, SQL, Statistical Programming, Data Warehousing, Machine Learning, Data Science

      4.5

      (574 reviews)

      Intermediate · Course · 1-4 Weeks

    • Placeholder
      Placeholder
      Databricks

      Apache Spark (TM) SQL for Data Analysts

      Skills you'll gain: Apache, Data Management, Data Analysis, Exploratory Data Analysis, Big Data, Basic Descriptive Statistics, Databases, Extract, Transform, Load, SQL, Business Analysis, Probability & Statistics, Statistical Analysis, Statistical Programming

      4.6

      (423 reviews)

      Intermediate · Course · 1-3 Months

    • Placeholder
      Placeholder
      Coursera Project Network

      Use the Apache Spark Structured Streaming API with MongoDB

      Intermediate · Guided Project · Less Than 2 Hours

    • Placeholder
      Placeholder
      École Polytechnique Fédérale de Lausanne

      Functional Programming Principles in Scala

      Skills you'll gain: Computer Programming, Other Programming Languages, Scala Programming, Algorithms, Computational Logic, Computer Programming Tools, Data Management, Data Structures, Mathematical Theory & Analysis, Mathematics, Programming Principles, Theoretical Computer Science

      4.8

      (8.2k reviews)

      Intermediate · Course · 1-3 Months

    • Placeholder
      Placeholder
      IBM Skills Network

      Introduction to Data Engineering

      Skills you'll gain: Data Engineering, Data Management, Extract, Transform, Load, Databases, Apache, Big Data, Data Analysis, Data Architecture, Data Warehousing, Leadership and Management, Network Security, Professional Development, SQL, Security Engineering, Computer Architecture, Computer Networking, Data Science, Database Administration, Distributed Computing Architecture, NoSQL, Project Management, Security Strategy, Statistical Programming, Strategy and Operations

      4.7

      (1.6k reviews)

      Beginner · Course · 1-4 Weeks

    • Placeholder
      Placeholder
      Google Cloud

      BigQuery Fundamentals for Snowflake Professionals

      Intermediate · Course · 1-3 Months

    Searches related to apache spark

    apache spark (tm) sql for data analysts
    use the apache spark structured streaming api with mongodb
    data engineering with ms azure synapse apache spark pools
    scalable machine learning on big data using apache spark
    1234…7

    In summary, here are 10 of our most popular apache spark courses

    • IBM Data Engineering: IBM Skills Network
    • BigQuery Fundamentals for Redshift Professionals: Google Cloud
    • Data Science with Databricks for Data Analysts: Databricks
    • Microsoft Azure Data Engineering Associate (DP-203): Microsoft
    • Introduction to Big Data with Spark and Hadoop: IBM Skills Network
    • Big Data Analysis with Scala and Spark: École Polytechnique Fédérale de Lausanne
    • Distributed Computing with Spark SQL: University of California, Davis
    • Apache Spark (TM) SQL for Data Analysts: Databricks
    • Use the Apache Spark Structured Streaming API with MongoDB: Coursera Project Network
    • Functional Programming Principles in Scala: École Polytechnique Fédérale de Lausanne

    Skills you can learn in Machine Learning

    Python Programming (33)
    Tensorflow (32)
    Deep Learning (30)
    Artificial Neural Network (24)
    Big Data (18)
    Statistical Classification (17)
    Reinforcement Learning (13)
    Algebra (10)
    Bayesian (10)
    Linear Algebra (10)
    Linear Regression (9)
    Numpy (9)

    Frequently Asked Questions about Apache Spark

    • Apache Spark is an open source analytics framework for large-scale data processing with capabilities for streaming, SQL, machine learning, and graph processing. Apache Spark is important to learn because its ease of use and extreme processing speeds enable efficient and scalable real-time data analysis.

      Apache Spark can process in-memory on dedicated clusters to achieve speeds 10-100 times faster than the disc-based batch processing Apache Hadoop with MapReduce can provide, making it a top choice for anyone processing big data. Spark is also easy to use, with the ability to write applications in its native Scala, or in Python, Java, R, or SQL. This versatility and accessibility helps startups harness the powerful data science they need for cutting edge innovation.

      Spark also provides the scalable machine learning needed by artificial intelligence (AI) engineers to create applications that can transform the way we interact with digital technology, from recommendation algorithms on services like Netflix and Spotify to automated medical screening.‎

    • Many careers in data science benefit from skills in Apache Spark, as software development engineers, data scientists, data analysts, and machine learning engineers use Spark on a daily basis. These roles are in high demand and are thus highly compensated; according to Glassdoor, machine learning engineers earn an average salary of $114,121 per year.

      Machine learning engineers design and build self-learning software and monitor its iterations to fine tune how models perform when they are scaled up and put into service. These professionals need a background in both software engineering and data science, and are increasingly being hired in a wide variety of fields such as education, healthcare, and finance. As machine learning continues to expand into many more fields, the need for machine learning engineers will continue to grow.‎

    • Yes! Coursera offers a wide range of popular online courses and Specializations on data science in general and Apache Spark specifically, including courses in related topics like scalable machine learning, distributed computing, and big data analysis. You’ll learn from top-ranked institutions and organizations like the University of California Davis, the University of California San Diego, École Polytechnique Fédérale de Lausanne, and IBM, so you don’t have to sacrifice the quality of your education for the flexibility of learning remotely.

      Coursera also offers the courses needed to work towards the IBM AI Engineering Professional Certificate. And, if you want to take your data science education to the next level, Coursera provides you with the opportunity to pursue a Master of Science in Data Science through the University of Colorado.‎

    • Because Spark works in application programming interfaces like Scala, Java, and Python, it helps to have a good grasp of one or more of these programming languages. Other prerequisites may vary depending on the level of the course you're taking. While beginner-level courses allow you to become familiar with Apache Spark and develop skills as you go, intermediate or advanced courses may require additional skills or experience within data science or computer programming. As you progress with learning Apache Spark, you'll develop the skills needed to read and write data to a variety of sources, parse different types of data, work within the artificial intelligence and machine learning arena, and transform data to leverage insights from it.‎

    • People with a passion for data science and a desire to gain increased access to big data are well suited to learning Apache Spark. This tool opens a variety of opportunities for users to explore big data and leverage it to solve key problems within organizations. Additionally, Spark offers a faster pace for machine learning workloads, with large scale data processing capability that's exponentially faster than other tools like Hadoop. Because Apache Spark is on the front lines of innovation within AI and big data, those with an innate sense of curiosity and a desire to innovate are among those best suited to learning Spark and working in relevant roles.‎

    • If you want to work within big data, learning Apache Spark could be a good move for you. This unified analytics engine is particularly popular because of its speed, the libraries that come with it, robust APIs, and its support for multiple programming languages. Additionally, it could be a smart career move depending on your aspirations. Demand continues to surge for professionals who can leverage Spark's power. In February 2021, Indeed.com listed more than 1,800 open positions looking for full-time Apache Spark professionals across multiple industries. Additionally, according to Databricks, learning Apache Sparks could give you a boost in your earning potential.‎

    This FAQ content has been made available for informational purposes only. Learners are advised to conduct additional research to ensure that courses and other credentials pursued meet their personal, professional, and financial goals.
    Other topics to explore
    Placeholder
    Arts and Humanities
    338 courses
    Placeholder
    Business
    1095 courses
    Placeholder
    Computer Science
    668 courses
    Placeholder
    Data Science
    425 courses
    Placeholder
    Information Technology
    145 courses
    Placeholder
    Health
    471 courses
    Placeholder
    Math and Logic
    70 courses
    Placeholder
    Personal Development
    137 courses
    Placeholder
    Physical Science and Engineering
    413 courses
    Placeholder
    Social Sciences
    401 courses
    Placeholder
    Language Learning
    150 courses

    Coursera Footer

    Learn Something New

    • Learn a Language
    • Learn Accounting
    • Learn Coding
    • Learn Copywriting
    • Learn HR
    • Learn Public Relations
    • Boulder MS Data Science
    • Illinois iMBA
    • Illinois MS Computer Science
    • UMich MS in Applied Data Science

    Popular Data Science Topics

    • Artificial Intelligence
    • Data Analysis
    • Data Engineering
    • Data Science
    • Excel
    • Machine Learning
    • Python
    • Power BI
    • R Programming
    • SQL

    Popular Computer Science & IT Topics

    • Blockchain
    • Coding
    • Computer Science
    • Cybersecurity
    • Full Stack Web Development
    • IT
    • Java
    • Software Engineering
    • Web Design
    • Web Development

    Popular Business Topics

    • Accounting
    • Business Finance
    • Communication Skills
    • Leadership & Management
    • Marketing
    • Product Management
    • Project Management
    • UX Design
    • UX Research
    • Writing

    Coursera

    • About
    • What We Offer
    • Leadership
    • Careers
    • Catalog
    • Coursera Plus
    • Professional Certificates
    • MasterTrack® Certificates
    • Degrees
    • For Enterprise
    • For Government
    • For Campus
    • Become a Partner
    • Coronavirus Response
    • Free Courses
    • All Courses

    Community

    • Learners
    • Partners
    • Beta Testers
    • Translators
    • Blog
    • Tech Blog
    • Teaching Center

    More

    • Press
    • Investors
    • Terms
    • Privacy
    • Help
    • Accessibility
    • Contact
    • Articles
    • Directory
    • Affiliates
    • Modern Slavery Statement
    Learn Anywhere
    Placeholder
    Placeholder
    Placeholder
    © 2023 Coursera Inc. All rights reserved.
    • Placeholder
    • Placeholder
    • Placeholder
    • Placeholder
    • Placeholder
    • Placeholder