Computing Science Course Outlines

Course Outline - CMPT 732 - PMP Lab I

Information

Subject

Catalog Number

Section

Semester

Title

Instructor(s)

Campus

CMPT

732

G100

2022 Fall (1227)

PMP Lab I

Gregory Baker    George Chow   

Burnaby Mountain Campus

Calendar Objective/Description

PMP Lab I

Instructor's Objectives

Many companies today collect massive amounts of data that cannot be managed without proper programming techniques. This lab courses focuses on the practical aspects of dealing with such data. It will provide insight on MapReduce, Spark, NoSQL databases, cloud computing, and data analytics for large data sets. Instructor's Objectives: ------------------------ The objective of this class is to ensure that students will be able to: - Use a distributed file system such as (or similar to) HDFS (Hadoop Distributed File System). - Write software that can interact with a distributed file system using programming tools that are part of Apache Hadoop. - Write simple distributed software using common tools. - Be able to formulate and implement queries on large data sets. - Write software that can interact with at least one non-relational database. You should have access to a computer powerful enough to run a virtual machine: at least 8 GB memory, 20 GB disk, and a reasonably decent processor. Computers are also available in our computer lab.

Prerequisites

see go.sfu.ca

Topics

  • Big data storage and analysis
  • Hadoop ecosystem
  • MapReduce
  • NoSQL database (HBase, Cassandra)
  • Cloud computing
  • Data analytics
  • Spark
  • Data Ingestion (Kafka)

Grading

To be discussed in the first week of class. Will include regular lab assignments, and a final project.

Academic Honesty Statement

Academic honesty plays a key role in our efforts to maintain a high standard of academic excellence and integrity. Students are advised that ALL acts of intellectual dishonesty will be handled in accordance with the SFU Academic Honesty and Student Conduct Policies ( http://www.sfu.ca/policies/gazette/student.html ).

Data Last Updated: