How ChatGPT Finds Information
Understanding how ChatGPT accesses information helps you optimize for citations.
Two Terms You'll See Throughout
This course uses two acronyms again and again, so let's define them up front:
- GEO (Generative Engine Optimization) — optimizing your content so AI assistants like ChatGPT and Claude surface, cite, and recommend it.
- SEO (Search Engine Optimization) — the long-standing practice of optimizing content to rank in traditional search engines like Google.
Two Knowledge Sources
1. Training Data (Parametric Knowledge)
ChatGPT has a knowledge cutoff date. Information encoded during training:
- Web content up to the cutoff
- Books, articles, publications
- Code repositories
For GEO: Your content must be high-quality enough to influence training data patterns.
2. Web Browsing (Non-Parametric Knowledge)
ChatGPT Plus and Enterprise can browse the web:
- Real-time search for current information
- Retrieves and cites web pages
- Provides links to sources
For GEO: Optimize for both training data AND real-time searchability.
The ChatGPT Citation Process
When ChatGPT browses:
- Query formulation: Converts user question to search queries
- Search: Retrieves top search results
- Evaluation: Assesses content relevance and quality
- Extraction: Pulls relevant information
- Citation: Includes source links in response
What Gets Cited
ChatGPT prioritizes:
- Content that ranks well in search
- Pages with clear, direct answers
- Authoritative sources
- Current, accurate information
- Specific facts over vague claims
What Doesn't Get Cited
ChatGPT avoids:
- Low-quality or thin content
- Outdated information
- Vague marketing language
- Content that contradicts reliable sources
Key Takeaway
For ChatGPT with browsing: you need strong SEO (to be found) AND strong GEO (to be cited once found). It's a two-step process.
Continue learning: Next up - how Claude's approach differs.

