CU3: Scalable Data Processing and Artificial Intelligence
Course Summary
This course provides an in-depth understanding of techniques and tools essential for large-scale data mining, emphasizing distributed data processing and modern data analysis technologies. The primary programming language used will be Python, complemented with some references to Java. Students will explore foundational principles of distributed computing, scalable data analysis, real-time data streaming, machine learning in distributed environments, and the integration of NoSQL databases in data mining tasks.
Course Highlights
- Fundamentals of Distributed Data Processing
- Large-Scale Data Mining Techniques
- Real-Time Stream Data Processing
- Machine Learning in Distributed Systems
- NoSQL Databases for Data Mining Integration
- Cloud-based Data Computing (MapReduce)
- Practical Projects and Experimental Evaluations
Instructors
Learn More
For detailed syllabus information, assessment methods, learning outcomes, and bibliographic references, please refer to the official course documentation.