Spaces:

Agents-MCP-Hackathon
/

MedCodeMCP

Sleeping

App Files Files Community

MedCodeMCP / README.md

gpaasch

MedCodeMCP is clearer. Capitalizing the “C” separates “Med” and “Code” and avoids the run-together feel of MedcodeMCP

04d1e41 7 months ago

preview code

raw

history blame

6.65 kB

	---
	title: MedCodeMCP
	emoji: 💬
	colorFrom: yellow
	colorTo: purple
	sdk: gradio
	sdk_version: 5.32.1
	app_file: app.py
	pinned: false
	license: apache-2.0
	short_description: an MCP Tool for Symptom-to-ICD Diagnosis Mapping.
	---
	A chatbot using [Gradio](https://gradio.app), [`huggingface_hub`](https://huggingface.co/docs/huggingface_hub/v0.22.2/en/index), and my local RTX 2060 instead of Cloud APIs

	# MedCodeMCP – an MCP Tool for Symptom-to-ICD Diagnosis Mapping

	## MVP Scope
	- Accept a patient’s symptom description (free-text input).
	- Output a structured JSON with a list of probable diagnoses, each including:
	- ICD-10 code
	- Diagnosis name
	- Confidence score
	- Handle a subset of common symptoms and return the top 3–5 likely diagnoses.

	## How It Works

	### Input Interface
	- Gradio-based demo UI for testing:
	- Single text box for symptoms (e.g., “chest pain and shortness of breath”).
	- Primary interface is programmatic (MCP client calls the server).

	### Processing Logic
	- Leverage an LLM (e.g., OpenAI GPT-4 or Anthropic Claude) to parse symptoms and suggest diagnoses.
	- Prompt example:
	> “The patient reports: {symptoms}. Provide a JSON list of up to 5 possible diagnoses, each with an ICD-10 code and a confidence score between 0 and 1. Use official ICD-10 names and codes.”
	- Recent experiments with medical foundation models (e.g., Google’s Med-PaLM/MedGEMMA) show they can identify relevant diagnosis codes via prompt-based reasoning ([medium.com](https://medium.com)).
	- Using GPT-4/Claude in the loop ensures rapid development and high-quality suggestions ([publish0x.com](https://publish0x.com)).

	### Confidence Scoring
	- Instruct the LLM to assign a subjective probability (0–1) for each diagnosis.
	- Accept approximate confidences for MVP.
	- Alternative: rank by output order (first = highest confidence).

	### ICD-10 Code Mapping
	- Trust LLM’s knowledge of common ICD-10 codes (e.g., chest pain → R07.9, heart attack → I21.x).
	- Sanity-check:
	- Maintain a small dictionary of common ICD-10 codes.
	- Use regex to verify code format.
	- Flag or adjust codes that don’t match known patterns.
	- Future improvement: integrate a full ICD-10 lookup list for validation.

	### Alternate Approach
	- Use an open model fine-tuned for ICD coding (e.g., Clinical BERT on Hugging Face) to predict top ICD-10 codes from clinical text.
	- Requires more coding and possibly a GPU, but feasible.
	- For hackathon MVP, prioritize API-based approach with GPT/Claude ([huggingface.co](https://huggingface.co)).

	### Output Format
	- JSON structure for easy agent parsing. Example:
	```json
	{
	"diagnoses": [
	{
	"icd_code": "I20.0",
	"diagnosis": "Unstable angina",
	"confidence": 0.85
	},
	{
	"icd_code": "J18.9",
	"diagnosis": "Pneumonia, unspecified organism",
	"confidence": 0.60
	}
	]
	}
	````

	* Input: “chest pain and shortness of breath”
	* Output: Cardiac-related issues (e.g., angina/MI) and respiratory causes, each with confidence estimates.
	* Structured output aligns with MCP tool requirements for downstream agent reasoning.

	## Gradio MCP Integration

	* Implement logic in `app.py` of a Gradio Space.
	* Tag README with `mcp-server-track` as required by hackathon.
	* Follow “Building an MCP Server with Gradio” guide:

	* Use Gradio SDK 5.x.
	* Define a tool function with metadata for agent discovery.
	* Expose a prediction endpoint.

	### Example Gradio Definition (simplified)

	```python
	import gradio as gr
	import openai
	def symptom_to_diagnosis(symptoms: str) -> dict:
	prompt = f"""The patient reports: {symptoms}. Provide a JSON list of up to 5 possible diagnoses, each with an ICD-10 code and a confidence score between 0 and 1. Use official ICD-10 names and codes."""
	response = openai.ChatCompletion.create(
	model="gpt-4",
	messages=[{"role": "system", "content": prompt}],
	temperature=0.2
	)
	# Parse response content as JSON
	return response.choices[0].message.content
	demo = gr.Interface(
	fn=symptom_to_diagnosis,
	inputs=gr.Textbox(placeholder="Enter symptoms here..."),
	outputs=gr.JSON(),
	title="MedCodeMCP Symptom-to-ICD Mapper",
	)
	demo.launch()
	```

	* Ensure MCP metadata is included so an external agent can discover and call `symptom_to_diagnosis`.

	## User Demo (Client App)

	* Create a separate Gradio Space or local script that:

	* Calls the MCP server endpoint.
	* Renders JSON result in a user-friendly format.
	* Optionally record a video demonstration:

	* Show an agent (e.g., Claude-2 chatbot) calling the MCP tool.
	* Verify end-to-end functionality.

	## MVP Development Steps

	1. Set Up Gradio Space

	* Initialize a new Hugging Face Space with Gradio SDK 5.x.
	* Tag README with `mcp-server-track`.

	2. Implement Symptom-to-Diagnosis Function

	* Write a Python function to:

	* Accept symptom text.
	* Call GPT-4/Claude API with JSON-output prompt.
	* Parse the model’s JSON response into a Python dictionary.
	* Sanitize and validate JSON output.
	* Fallback: rule-based approach or offline model for demo cases if API limits are reached.

	3. Testing

	* Input various symptom combinations.
	* Verify sensibility of diagnoses and correctness of ICD-10 codes.
	* Tweak prompt to improve specificity.
	* Ensure JSON structure is valid.

	4. Confidence Calibration

	* Define how confidence scores are assigned:

	* Use LLM’s self-reported confidences, or
	* Rank by output order.
	* Document confidence methodology in README.

	5. Integrate with Gradio Blocks

	* Wrap the function in a Gradio interface (`gr.Interface` or `gr.ChatInterface`).
	* Expose function as an MCP tool with appropriate metadata.
	* Test via `gradio.Client` or HTTP requests.

	6. Build a Quick Client (Optional)

	* Option A: Second Gradio Space as MCP client showing how an LLM calls the tool.
	* Option B: Local script using `requests` to call the deployed Space’s prediction API.
	* Prepare a screen recording illustrating agent invocation.

	7. Polish Documentation

	* In README:

	* Explain tool functionality and usage.
	* Include hackathon requirements: track tag, demo video or client link.
	* List technologies used (e.g., “Uses OpenAI GPT-4 via API to map symptoms to diagnoses”).
	* Provide example usage and sample inputs/outputs.

	By completing these steps, the MVP will demonstrate end-to-end functionality: input symptoms → structured diagnostic insights with ICD-10 codes and confidence scores via an MCP server.