Computer Science
CS 6240: Large-Scale Parallel Data Processing
Lecture - 4 credits
ND
EI
IC
FQ
SI
AD
DD
ER
WF
WD
WI
EX
CE
- Covers big-data analysis techniques that scale out with increasing number of compute nodes, e.g., for cloud computing.
- Emphasizes approaches for problem and data partitioning that distribute work effectively, while keeping total cost for computation and data transfer low.
- Studies and analyzes deterministic and random algorithms from a variety of domains, including graphs, data mining, linear algebra, and information retrieval in terms of their cost, scalability, and robustness against skew.
- Course work emphasizes hands-on programming experience with modern state-of-the-art big-data processing technology.
- Students who do not meet course prerequisites may seek permission of instructor.
Covers big-data analysis techniques that scale out with increasing number of compute nodes, e.g., for cloud computing. Show more.