Guide

🌱 Beginner — AI Fundamentals

Chapter 8 of 24

🧹 Chapter 8: Normalization (Cleaning the Mess)

Making text consistent for the model

Humans: "The café is open." / "THE CAFE IS OPEN!!!" / "the café is open" — same meaning. Computer? Totally different strings.

What normalization does

Cleans text: lowercase everything, remove extra punctuation, standardize accents.

Before

THE CAFE IS OPEN!!!

After

the cafe is open

⚖ Tradeoff

More cleaningLess cleaning
Faster trainingMore nuance
Smaller modelsRicher context