Jul 22, 2024  
2019-2020 Course Catalog 
2019-2020 Course Catalog [ARCHIVED CATALOG]

Add to Portfolio (opens a new window)

CVF 1072 - Machine Learning with Big Data Technology

Credits: 4
Hours/Week: Lecture 3 Lab 1
Course Description: This course is for students who want to attain Operational Intelligence level 4, (business insights) and covers implementing analytics and Big Data projects using Splunk’s statistics, machine learning, and built-in and custom visualization capabilities.   The course introduces students to the theory and methods of analytics and statistical modeling by exploring and operationalizing an analytics project using data visualization techniques.  Student will have the opportunity to apply all the concepts learned in this class by using Splunk Big Data technology to solve hands-on scenario-based examples and hands-on challenges throughout the course.  The Data Science Associate (EMCDSA) certification exam is part of this course.
MnTC Goals

Prerequisite(s): CVF 1071  and CVF 1205  or CSCI 1060  with grades of C or higher OR instructor consent.
Corequisite(s): None
Recommendation: None

Major Content
  1. Analytics Framework
  2. Exploratory Data Analysis
  3. Overview of Machine Learning
  4. Market Segmentation
  5. Transactional Analysis
  6. Anomaly Detection
  7. Estimation and Prediction
  8. Classification
  9. Data Visualization
  10. Splunk IT Service Intelligence Advanced Analytics
  11. Splunk User Behavior Analytics
  12. Custom Machine Learning with Splunk
  13. Modeling Use Cases with Splunk

Learning Outcomes
At the end of this course, students will be able to:

  1. define terms related to analytics and data science.
  2. describe the framework for multi-departmental analytics projects.
  3. identify analytics project best practices.
  4. identify common use cases.
  5. define exploratory data analysis.
  6. identify exploratory data analysis use cases.
  7. describe Splunk exploratory data analysis solutions.
  8. define fundamental concepts and terms associated with machine learning.
  9. model data using machine learning.
  10. split data for training and testing models.
  11. define market segmentation.
  12. identify market segmentation use cases.
  13. describe Splunk market segmentation solutions.
  14. define Transactional Analysis.
  15. identify Transactional Analysis use cases.
  16. describe Transactional Analysis solutions.
  17. define anomaly detection.
  18. identify anomaly detection use cases.
  19. describe Splunk anomaly detection solutions.
  20. define estimation and prediction.
  21. identify estimation and prediction use cases.
  22. describe Splunk estimation and prediction
  23. solutions.
  24. describe why classification is key to analytics.
  25. define key classification terms.
  26. evaluate classifier results.
  27. describe why data visualization is key to analytics.
  28. optimize Splunk visualizations for specific data and personas.
  29. enrich machine data with structured business data.
  30. prep data for advanced analytics using Splunk.
  31. leverage packaged machine learning (Splunk) to identify root cause of service disruptions.
  32. deploy machine learning using Splunk IT Service Intelligence.
  33. detect insider threats using Splunk User Behavior Analytics with built-in machine learning .
  34. establish baseline for normal operational patterns and use statistical measurements to determine threshold variability patterns.
  35. dynamically adapt thresholds to changing behavior and highlight anomalous activity.
  36. combine event data with advanced analytics to reduce event clutter, false positives, and extensive rules maintenance.
  37. implement automated detection of insider threats and external attacks.
  38. identify deviations across multiple entities-users, devices, and applications-by comparing them against an entity’s baseline and its dynamically generated peer groups.
  39. provide visualization of statistical aggregates across multiple entities along with enriched kill-chain visualization of a threat vector.
  40. administer model building, validation, and deployment.
  41. provide interactive examples for typical IT, security, business process, and IoT (Internet of Things) use cases
  42. build models using any of the 300+ algorithms accessible through the ML Toolkit.
  43. Use Splunk to automatically detect anomalies and patterns in data to help investigators identify and resolve incidents.
  44. identify normal data patterns in varying levels of detail in order to alert only on abnormal conditions for a specific set of circumstances.
  45. identify patterns of activity to anticipate and react to circumstances that might otherwise disrupt operations or revenues such as proactive maintenance.
  46. apply machine learning analysis to historical data and models to forecast demand, manage inventory, optimize operations, and react to changing conditions.​

Competency 1 (1-6)
Competency 2 (7-10)

Courses and Registration

Add to Portfolio (opens a new window)