eudr_chabo_orchestrator

Running on CPU Upgrade

App Files Files Community

mtyrrell commited on Oct 1

Commit

ef736ff

1 Parent(s): c8f5440

cleanup

Browse files

Files changed (2) hide show

README.md +37 -7
params.cfg +1 -1

README.md CHANGED Viewed

@@ -148,28 +148,58 @@ Abstraction layer for managing different retriever configurations:
 Helper functions
 ## Configuration
 ### Configuration File (`params.cfg`)
 ```ini
 [file_processing]
-# Direct output mode: return ingestor results immediately
 DIRECT_OUTPUT = True
 [retriever]
-RETRIEVER = https://your-retriever.hf.space/
-COLLECTION_NAME = YOUR_COLLECTION
 [generator]
-GENERATOR = https://your-generator.hf.space
 [ingestor]
-INGESTOR = https://your-ingestor.hf.space
 [general]
-# Context limit for LLM (tokens ~= chars/4)
-MAX_CONTEXT_CHARS = 15000
 ```
 ### Environment Variables

 Helper functions
+#### Conversation Context Management
+The `build_conversation_context()` function manages conversation history to provide relevant context to the generator while respecting token limits and conversation flow.
+**Key Features:**
+- **Context Selection**: Always includes the first user and assistant messages to maintain conversation context
+- **Recent Turn Limiting**: Includes only the last N complete turns (user + assistant pairs) to focus on recent conversation (default: 3)
+- **Character Limit Management**: Truncates to maximum character limits to prevent context overflow
+**Function Parameters:**
+```python
+def build_conversation_context(
+    messages,           # List of Message objects from conversation
+    max_turns: int = 3, # Maximum number of recent turns to include
+    max_chars: int = 8000  # Maximum total characters in context
+) -> str
+```
 ## Configuration
 ### Configuration File (`params.cfg`)
 ```ini
 [file_processing]
+# Enable direct output mode: when True, ingestor results are returned directly
+# without going through the generator. When False, all files go through full RAG pipeline.
+# This also prevents ChatUI from resending the file in the conversation history with each turn
+# Note: File type validation is handled by the ChatUI frontend
 DIRECT_OUTPUT = True
+[conversation_history]
+# Limit the context window for the conversation history
+MAX_TURNS = 3
+MAX_CHARS = 12000
 [retriever]
+RETRIEVER = https://giz-chatfed-retriever0-3.hf.space/
+# Optional
+COLLECTION_NAME = EUDR
 [generator]
+GENERATOR = https://giz-eudr-chabo-generator.hf.space
 [ingestor]
+INGESTOR = https://giz-eudr-chabo-ingestor.hf.space
 [general]
+# need to include this for HF inference endpoint limits
+MAX_CONTEXT_CHARS = 15000
 ```
 ### Environment Variables

params.cfg CHANGED Viewed

@@ -8,7 +8,7 @@ DIRECT_OUTPUT = True
 [conversation_history]
 # Limit the context window for the conversation history
 MAX_TURNS = 3
-MAX_CHARS = 8000
 [retriever]
 RETRIEVER = https://giz-chatfed-retriever0-3.hf.space/

 [conversation_history]
 # Limit the context window for the conversation history
 MAX_TURNS = 3
+MAX_CHARS = 12000
 [retriever]
 RETRIEVER = https://giz-chatfed-retriever0-3.hf.space/