Lesson 7.1: Project Overview and Requirements
System Requirements
Functional:
- Support 10k concurrent users
- Sub-100ms query latency for features
- Store 1M+ documents for RAG
- Log all model predictions
- Track user conversations
- Real-time analytics dashboard
Non-Functional:
- ACID transactions for critical data
- Horizontal read scaling (replicas)
- Partitioning for time-series data
- Proper indexing (all queries under 100ms)
- Data retention policies
Architecture Diagram
User Query
β
[API Layer]
β
βββββββββββββββββββββββββββββββββββββββ
β PostgreSQL (Primary) β
β β
β ββββββββββββββββββββββββββββββββ β
β β RAG System β β
β β - documents β β
β β - chunks (with embeddings) β β
β β - permissions β β
β ββββββββββββββββββββββββββββββββ β
β β
β ββββββββββββββββββββββββββββββββ β
β β Feature Store β β
β β - user_features (cached) β β
β ββββββββββββββββββββββββββββββββ β
β β
β ββββββββββββββββββββββββββββββββ β
β β Conversation Memory β β
β β - conversations β β
β β - messages β β
β β - agent_entities β β
β ββββββββββββββββββββββββββββββββ β
β β
β ββββββββββββββββββββββββββββββββ β
β β Model Logs (Partitioned) β β
β β - predictions_2024_01 β β
β β - predictions_2024_02 β β
β ββββββββββββββββββββββββββββββββ β
βββββββββββββββββββββββββββββββββββββββ
β
β
[Read Replicas]
- Analytics
- Dashboard
Key Takeaways
- Capstone project builds a complete AI-powered customer support system
- Four main components: RAG system, feature store, conversation memory, model logs
- Functional requirements focus on scale (10k users, 1M+ documents)
- Performance requirements demand sub-100ms latency for features
- Architecture uses PostgreSQL primary + read replicas for horizontal scaling
- Time-series data requires partitioning for model predictions
- Production readiness includes ACID transactions, proper indexing, and retention policies
Discussion
Sign in to join the discussion.
0 comments

