Course Outline

  1. Data preprocessing

    1. Data Cleaning
    2. Data integration and transformation
    3. Data reduction
    4. Discretization and concept hierarchy generation
  2. Statistical inference

    1. Probability distributions, Random variables, Central limit theorem
    2. Sampling
    3. Confidence intervals
    4. Statistical Inference
    5. Hypothesis testing
  3. Multivariate linear regression

    1. Specification
    2. Subset selection
    3. Estimation
    4. Validation
    5. Prediction
  4. Classification methods

    1. Logistic regression
    2. Linear discriminant analysis
    3. K-nearest neighbours
    4. Naive Bayes
    5. Comparison of Classification methods
  5. Neural Networks

    1. Fitting neural networks
    2. Training neural networks issues
  6. Decision trees

    1. Regression trees
    2. Classification trees
    3. Trees Versus Linear Models
  7. Bagging, Random Forests, Boosting

    1. Bagging
    2. Random Forests
    3. Boosting
  8. Support Vector Machines and Flexible disct

    1. Maximal Margin classifier
    2. Support vector classifiers
    3. Support vector machines
    4. 2 and more classes SVM’s
    5. Relationship to logistic regression
  9. Principal Components Analysis

  10. Clustering

    1. K-means clustering
    2. K-medoids clustering
    3. Hierarchical clustering
    4. Density based clustering
  11. Model Assesment and Selection

    1. Bias, Variance and Model complexity
    2. In-sample prediction error
    3. The Bayesian approach
    4. Cross-validation
    5. Bootstrap methods
 28 Hours

Number of participants



Price per participant

Testimonials (3)

Related Courses

Knowledge Discovery in Databases (KDD)

21 Hours

Introduction to Data Visualization with Tidyverse and R

7 Hours

Econometrics: Eviews and Risk Simulator

21 Hours

HR Analytics for Public Organisations

14 Hours

Statistical Analysis using SPSS

21 Hours

Talent Acquisition Analytics

14 Hours

Advanced R

7 Hours

Algorithmic Trading with Python and R

14 Hours

Anomaly Detection with Python and R

14 Hours

Programming with Big Data in R

21 Hours

R Fundamentals

21 Hours

Cluster Analysis with R and SAS

14 Hours

Data and Analytics - from the ground up

42 Hours

Data Analytics With R

21 Hours

Data Mining with R

14 Hours

Related Categories

1