CO3519 Artificial Intelligence
CO3519 Lecture 2 - AI Types & Areas in AI
Lecture Documents¶
Written Notes¶



Learning Objectives¶
- Machine Learning (ML)
- Types of Machine Learning
- Relation between ML & AI
- Key steps in developing ML model
Machine Learning¶
What is Machine Learning¶
Quote
Subfield of AI that focuses on giving machines the ability to learn and make decisions from data without being explicitly programmed.
Types of Machine Learning¶
- Supervised Learning
- Unsupervised Learning
- Reinforcement Learning
Supervised Learning¶
Supervised learning the training of a ML model on labelled datasets where both input data and the correct output (label) are known.
Key Details¶
The algorithm learns by mapping inputs to their corresponding outputs.
- The algorithm learns by mapping inputs to their corresponding outputs.
- Goal is to predict labels for new unseen data.
- Performance is measured using the metrics:
- Accuracy
- Precision
- Recall
- F1-Score
Common Algorithms¶
- Linear Regression - Predicts continuous values.
- Logistic Regression - Used for binary classification.
- Support Vector Machines (SVM) - Finds the optimal separating hyperplane.
- Decision Trees - Tree-like models for classification or regression.
Example Applications
- Image Classification (e.g. cat vs dog, mouse vs rat)
- Spam Email Detection
- Sentiment Analysis
- Disease Diagnosis
Lecture Example¶

A database containing both cat and dog images are separated and labelled before being processed into a trained model which identify which are cat or dog by their assigned label.
Unsupervised Learning¶
Unsupervised learning trains a model on unlabelled data, meaning the correct outputs are unknown.
Key Details¶
- The algorithm discovers hidden patterns or structures in data.
- Goal is to group or reduce data dimensions without prior labels.
- Performance is often evaluated using internal metrics (e.g. Silhouette Score, Davies–Bouldin Index).
Common Algorithms¶
- K-Means Clustering – Groups data into k clusters based on similarity.
- Hierarchical Clustering – Builds a tree of clusters.
- Principal Component Analysis (PCA) – Reduces dimensionality while preserving variance.
- Association Rule Learning (Apriori) – Finds relationships between variables in large datasets.
Example Applications
- Customer Segmentation
- Market Basket Analysis
- Anomaly Detection
- Topic Modeling
Reinforcement Learning¶
Reinforcement learning (RL) trains an agent to make successive decisions by interacting with an environment to maximise rewards and minimise punishment.
Key Details¶
- No labelled data — learning via trial and error.
- The agent observes a state, performs an action, and receives a reward.
- Goal is to learn a policy that maximizes total reward over time.
- Evaluated by convergence speed and long-term reward.
Common Algorithms¶
- Q-Learning – Learns value of actions using Q-values.
- Deep Q-Network (DQN) – Combines Q-learning with neural networks.
- Policy Gradient Methods – Directly optimize the policy function.
- SARSA – On-policy version of Q-learning.
Example Applications
-
Game AI (e.g. AlphaGo, Chess, Dota 2 bots)
-
Autonomous Driving
-
Robotics Control
-
Dynamic Pricing
Lecture Example¶
Positive reinforcement on a chicken.
Agent - Chicken
Action - Touching PINK circle.
Reward - corn grain
Key Steps in Developing an ML Model¶
flowchart TD
A[Problem Definition • Clearly define the task you are solving] -->
B[Data Collection • Collect data for training/testing] -->
C[Data Pre-processing • Clean and transform data] -->
D[Features Extraction • Transform data into measurable properties] -->
E[Model Selection • Choose algorithm based on problem type] -->
F[Model Training / Evaluation • Use training data to optimize parameters]
classDef default fill:#007ACC,color:#ffffff,stroke:#333,stroke-width:1px; Problem Definition Examples
- Predicting House Prices
- Classify Emails
- Detect Suspicious Object
Data Collection Examples
- Housing datasets from public records or APIs
- Image datasets (e.g. ImageNet, CIFAR-10)
- Official medical datasets from research institutions
Data Pre-Processing Examples
- Numerical Data: Principal Component Analysis (PCA), statistical normalization
- Text Data: Bag of Words, Word embeddings (Word2Vec), TF-IDF
- Image Data: Edge Detection, Histogram of Oriented Gradients (HOG), Convolutional Neural Networks (CNN)
Feature Extraction Examples
- Using PCA for dimensionality reduction
- Extracting statistical features from signals
- Deriving deep feature maps with CNN layers
Model Selection Examples
- Linear Regression for continuous value prediction
- Logistic Regression for binary classification
- Decision Trees or Random Forests for mixed-type data
- Support Vector Machines (SVM) for clear class separation
Model Training & Evaluation Examples
- Splitting data into training (70%), validation (15%), and testing (15%) sets
- Evaluating models with Accuracy, Precision, Recall, and F1-Score
- Testing a spam detector on new email samples
Lab Activity¶
Solve house price prediction problem using LR.
Next Week¶
- Key Challenges in Machine Learning
- Supervised Learning
- Linear Regression
- Logistic Regression
- Support Vector Machines (SVM)
- Decision Trees