Artificial Intelligence

[Day 3] Machine Learning (ML) and Neural Networks (NN)

Learn how Machine Learning teaches computers to learn from data and how Neural Networks mimic the human brain to power AI like face recognition and chatbots.

Akshay Seth

05 Dec 2024 • 7 min read

What is Machine Learning?

Image source: https://www.ibm.com/think/topics/artificial-intelligence

Machine learning (ML) is a branch of artificial intelligence (AI) that enables computers to learn and make decisions or predictions without being explicitly programmed. Instead of following a rigid set of rules, ML models identify patterns in data and use those patterns to make informed decisions.

At its core, machine learning is the science of enabling machines to 'improve their performance over time through experience'. It’s like teaching a computer how to learn from data, much like humans learn from experience.

For example:

If you want a computer to recognize a cat in a picture, instead of defining every possible detail of what a cat looks like, you feed it thousands of pictures labeled "cat" and "not cat." The computer learns to differentiate based on patterns in the data.

How Does Machine Learning Work?

Machine learning operates through a combination of data, algorithms, and models. Here’s a simplified explanation:

Data Collection:
The process starts with gathering relevant data. This data is usually large in volume and can include text, images, numbers, or other formats.
Data Preparation:
Raw data is often messy. It needs cleaning (removing errors or irrelevant information) and structuring to ensure quality input for the ML model.
Choosing an Algorithm:
Algorithms are the mathematical recipes used by ML to process data. Popular algorithms include decision trees, neural networks, and clustering techniques.
We will talk about each of them in detail in next couple of days.
Training the Model:
The algorithm is applied to the data during the training phase. The model learns the patterns and relationships within the dataset.
Testing the Model:
After training, the model is tested with new, unseen data to evaluate its performance and accuracy.
Making Predictions:
Once trained and tested, the model can make predictions or decisions when provided with new input data.
Improvement Over Time:
Feedback loops help the model learn and improve. With more data and fine-tuning, accuracy and reliability often increase.

We will make a few small PythonPython-based projects in the coming days to make you understand better.

Key Ingredients for Machine Learning

Data: High-quality, diverse data is essential for effective learning.
Computing Power: Training models, especially large ones, requires significant computational resources.
Algorithms: The choice of algorithm depends on the problem you’re solving (e.g., classification, regression, clustering).

Types of ML:

Machine learning (ML) is often categorized into three main types: Supervised Learning, Unsupervised Learning, and Reinforcement Learning. These categories are defined by how the model learns from the data and the kind of problem it aims to solve.

1. Supervised Learning

Supervised learning involves teaching a model using labeled data, where each data point has an input (features) and a corresponding output (label). The model learns to map the inputs to the outputs by finding patterns in the labeled data.

How It Works:

Training: The model is trained on a dataset with known inputs and outputs (e.g., “features” like height and weight, and “label” like “healthy” or “not healthy”).
Prediction: Once trained, the model can predict the output for new, unseen inputs.
Error Correction: The model evaluates its predictions, adjusts its parameters, and minimizes errors over time.

Example:

Email Spam Detection: The model learns to classify emails as "spam" or "not spam" based on features like the presence of certain keywords or the sender's reputation.
Loan Approval: A bank uses customer data (age, income, credit score) to predict whether a loan should be approved.

2. Unsupervised Learning

Unsupervised learning deals with unlabeled data, where the model isn’t given explicit outputs to predict. Instead, it identifies hidden patterns, relationships, or structures in the data.

How It Works:

Clustering: Groups similar data points into clusters based on their features.
Dimensionality Reduction: Simplifies complex datasets by reducing the number of variables while retaining important information.

Example:

Customer Segmentation: In marketing, an unsupervised model can group customers based on purchasing behavior to identify distinct customer segments.
Anomaly Detection: Detecting unusual activity, such as fraud in a banking system, by recognizing outliers in data.

3. Reinforcement Learning

Reinforcement learning (RL) is inspired by behavioral psychology, where an agent learns by interacting with its environment. The agent takes actions and receives rewards or penalties, learning to maximize cumulative rewards over time.

How It Works:

Agent and Environment: The agent (e.g., a robot or program) interacts with the environment (e.g., a game or a physical world).
Rewards and Punishments: The agent gets feedback based on the outcomes of its actions.
Exploration and Exploitation: The agent explores different strategies while exploiting learned knowledge to improve performance.

Example:

Game Playing: AlphaGo, a reinforcement learning system, became a world champion in the game of Go by learning optimal strategies through millions of games.
Self-Driving Cars: The car (agent) learns to navigate roads (environment) by optimizing its actions (steering, braking, accelerating) based on rewards (safety and efficiency).

What is a Neural Network?

Neural Networks (NNs) are computational systems designed to mimic the structure and functioning of the human brain, particularly in how they process and interpret information to make decisions or recognize patterns.

Neural Networks (NNs) are inspired by the human brain in how they process information. Here's how they reflect it:

Image source: https://www.geeksforgeeks.org/artificial-neural-networks-and-its-applications/

Neurons:
In the brain, neurons receive signals, process them, and pass them along. Similarly, in a neural network, artificial neurons (nodes) take input, process it, and send output to the next layer.
Connections:
In the brain, neurons are connected through synapses, with stronger connections for more important signals. In NNs, weights represent these connections, determining how important an input is for the output.
Learning:
The brain learns by adjusting neuron connections over time. Neural networks learn by adjusting weights and biases during training, using techniques like backpropagation.
Layers of Processing:
The brain processes information in hierarchical layers (e.g., raw sensory input → recognition → decision-making). Similarly, NNs have input, hidden, and output layers to extract and refine patterns step by step.
Pattern Recognition:
Both the brain and NNs excel at recognizing patterns, whether it’s faces in photos or trends in data.

While NNs are simplified models of the brain, they mimic its structure and functionality enough to perform complex tasks efficiently.

Neural networks are the backbone of many advanced AI applications, including image recognition, natural language processing, and autonomous vehicles.

How Does a Neural Network Work?

Neural networks process data in layers. Each layer applies mathematical transformations to the input data, gradually extracting higher-level features.

Input Layer:
- Receives raw data (e.g., an image, text, or numbers).
- Each input is represented as a numerical value (features).
Hidden Layers:
- These are intermediate layers where the actual computation happens.
- Each node (neuron) processes the input using a mathematical function (weighted sum + activation function) and passes the result to the next layer.
- The number of hidden layers and neurons determines the network's complexity.
Output Layer:
- Produces the final output (e.g., classifying an image as a "cat" or "dog").
- For regression tasks, it outputs a numerical value; for classification, it outputs probabilities or categories.

Key Concepts in Neural Networks

1. Weights and Biases

Think of a neural network as a decision-making system:

Weights: These are like the importance levels for each input. For example, if you’re deciding whether to take an umbrella, the weight for "is it raining?" will be higher than "do I like carrying an umbrella?"
Biases: These act as a nudge or adjustment. They help the network make better decisions by adding flexibility to the process.

The network learns these weights and biases during training so it can make accurate predictions.

2. Activation Function

In the real world, relationships between inputs and outputs aren’t always straightforward. For example:

Doubling your effort doesn’t always double your results.

An activation function adds a bit of "non-linearity" to the network. It helps the network understand more complex relationships, like how things aren’t always a straight line.

Common examples:

ReLU: Keeps positive values as they are and turns negative values into zero. It’s simple and fast.
Sigmoid: Squashes values between 0 and 1, often used when outputs need to represent probabilities.
Tanh: Similar to Sigmoid but squashes values between -1 and 1, useful when outputs can be negative.

3. Training with Backpropagation

This is how the network improves itself, step by step.

Initial Guess: The network starts with random weights and biases, so its first predictions are usually wrong.
Error Calculation: The difference between the network’s guess and the actual answer (the error) is calculated.
Learning from Mistakes: The network uses a process called backpropagation to figure out which weights and biases caused the error. It then adjusts them slightly to make better predictions next time.
Repeat: This process is repeated for many rounds (iterations) until the network becomes very good at making predictions.

Analogy for Backpropagation

Imagine you’re learning to throw a dart at a bullseye:

On your first throw, you miss the target.
You measure how far off you were.
Based on that, you adjust your aim slightly.
After many tries, your throws get closer and closer to the bullseye.

That’s how backpropagation works: Adjusting the network’s aim (weights and biases) to minimize the error.

The easiest way to visualize the Neural Network is to watch this youtube video:

Important Info:

Don’t stress about the math behind neural networks—it’s already figured out and widely used! Just focus on understanding the basics. We’ll explore each algorithm step by step with real-life examples, use cases, and small Python projects. So, Chill! 😊

💬 Join the DecodeAI WhatsApp Channel for regular AI updates → Click here

What is Machine Learning?

At its core, machine learning is the science of enabling machines to 'improve their performance over time through experience'. It’s like teaching a computer how to learn from data, much like humans learn from experience.

How Does Machine Learning Work?

Types of ML:

1. Supervised Learning

How It Works:

Example:

2. Unsupervised Learning

How It Works:

Example:

3. Reinforcement Learning

How It Works:

Example:

What is a Neural Network?

Neural Networks (NNs) are computational systems designed to mimic the structure and functioning of the human brain, particularly in how they process and interpret information to make decisions or recognize patterns.

How Does a Neural Network Work?

Key Concepts in Neural Networks

1. Weights and Biases

2. Activation Function

3. Training with Backpropagation

Sign up for more like this.