Core Concepts
MiniLLM consists of small, composable modules that can be swapped out as your application grows. The default configuration uses Gemini for text generation and embeddings.
Memory Manager
MemoryManager
stores a rolling summary of past conversations. It supports a Redis backend for persistence and an in-memory fallback for quick testing.
Retrievers
Two retrievers help fetch relevant documents:
VectorRetriever
uses FAISS for similarity search. It calls thegemini_embed
function to compute embeddings.GraphRetriever
performs Neo4j graph search and also relies on the Gemini embedding API for text similarity.
Orchestrator
SimpleOrchestrator
connects each step of the workflow in a directed acyclic graph. You can add nodes and edges to customise the execution order.
Together these pieces let you build durable, stateful agents that remember past context and retrieve domain knowledge when answering new questions.