Course Outline - CMPT 732 - PMP Lab I
Information
Subject
Catalog Number
Section
Semester
Title
Instructor(s)
Campus
CMPT
732
G100
2022 Fall (1227)
PMP Lab I
Gregory Baker George Chow
Burnaby Mountain Campus
Calendar Objective/Description
PMP Lab I
Instructor's Objectives
Many companies today collect massive amounts of data that cannot be managed without proper programming techniques. This lab courses focuses on the practical aspects of dealing with such data. It will provide insight on MapReduce, Spark, NoSQL databases, cloud computing, and data analytics for large data sets. Instructor's Objectives: ------------------------ The objective of this class is to ensure that students will be able to: - Use a distributed file system such as (or similar to) HDFS (Hadoop Distributed File System). - Write software that can interact with a distributed file system using programming tools that are part of Apache Hadoop. - Write simple distributed software using common tools. - Be able to formulate and implement queries on large data sets. - Write software that can interact with at least one non-relational database. You should have access to a computer powerful enough to run a virtual machine: at least 8 GB memory, 20 GB disk, and a reasonably decent processor. Computers are also available in our computer lab.
Prerequisites
see go.sfu.ca
Topics
- Big data storage and analysis
- Hadoop ecosystem
- MapReduce
- NoSQL database (HBase, Cassandra)
- Cloud computing
- Data analytics
- Spark
- Data Ingestion (Kafka)
Grading
To be discussed in the first week of class. Will include regular lab assignments, and a final project.
Academic Honesty Statement
Academic honesty plays a key role in our efforts to maintain a high standard of academic excellence and integrity. Students are advised that ALL acts of intellectual dishonesty will be handled in accordance with the SFU Academic Honesty and Student Conduct Policies ( http://www.sfu.ca/policies/gazette/student.html ).