Library: Northwestern AI in Education: Generative AI

What is generative AI?

"Artificial intelligence that is capable of generating new content (such as images or text) in response to a submitted prompt by learning from a large reference database of examples."-dictionary definition from Merriam-Webster

"Generative AI refers to artificial intelligence systems that can create new content, such as text, images, music, or code, based on patterns learned from existing data. These models use machine learning techniques, particularly deep learning, to generate outputs that mimic human creativity and intelligence."-definition generated by ChatGPT

How does generative AI work?

Generative AI relies on advanced machine learning models to create new content, whether it’s text, images, music, or even video. These systems learn patterns from vast datasets and use probability-based predictions to generate outputs. Below is a step-by-step breakdown of how generative AI works:

Training Data & Machine Learning Models

Generative AI models are built using large-scale datasets that help them learn how to generate content. The type of data varies depending on the AI’s purpose:

Text Models (e.g., ChatGPT, Claude) are trained on books, articles, and websites.
Image Models (e.g., DALL·E, MidJourney) are trained on photographs, paintings, and digital artwork.
Music Models (e.g., Jukebox, Soundraw) analyze melodies, chord progressions, and lyrics from existing songs.

To process this data, AI models use deep learning, a subset of machine learning inspired by how the human brain works. These models are built using artificial neural networks, which identify patterns and structures within the data.

Neural Networks & Pattern Recognition

At the heart of generative AI is the neural network, a system designed to recognize relationships in data and generate outputs based on learned patterns.

How it works in simple terms:

The AI model scans millions (or billions) of examples.
It identifies patterns, styles, and relationships within the data.
Using statistical probability, it predicts the most likely next word, pixel, or sound based on what it has learned.

For example, if you ask a text-based AI to write a story, it doesn’t retrieve a pre-written response. Instead, it predicts the most likely sequence of words based on context and patterns learned during training.

How AI Generates Content

Once trained, generative AI can produce content through a process called sampling—essentially, making educated guesses based on patterns.

For Text Generation (e.g., ChatGPT):

The model receives a prompt (e.g., “Write a poem about space”).
It breaks the request down into components (space-related words, poetic structure).
It generates text one word at a time by predicting what logically comes next.
The result is a cohesive response based on what it learned from training data.

For Image Generation (e.g., DALL·E, MidJourney):

The model receives a text prompt (e.g., “A futuristic city at sunset”).
It converts the text into a set of mathematical instructions.
It uses diffusion models to generate images pixel by pixel based on style and composition learned from training data.

For Music Generation (e.g., Jukebox, Soundraw):

The AI analyzes existing music for melody, rhythm, and structure.
It generates new compositions by blending learned patterns to create original pieces.

Fine-Tuning & Reinforcement Learning

Once an AI model is trained, it undergoes fine-tuning to improve accuracy, reduce biases, and align with user needs.

Reinforcement Learning from Human Feedback (RLHF): Developers train AI using human reviewers to reward good responses and penalize inaccurate or biased outputs.
Bias Mitigation: AI models are refined to remove harmful or misleading patterns picked up during training.
Customization: Some AI models allow users to train them on specific datasets for specialized purposes (e.g., legal research, medical diagnoses).

Limitations & Challenges

While generative AI is powerful, it’s not perfect. Some common challenges include:

AI Hallucinations – AI sometimes makes up information or generates incorrect responses.
Bias in Training Data – If the data used to train the model has biases, the AI may produce biased results.
Lack of True Understanding – AI doesn’t think or comprehend like humans—it merely predicts patterns.
Computational Costs – Running large AI models requires high processing power, making them expensive to train and maintain. Sustainability is a major concern when it comes to large-scale usage of AI.

Northwestern Campus Library

Northwestern AI in Education