NexaAI
/

Granite-4.0-h-350M-NPU-mobile

Model card Files Files and versions

nexaml commited on 22 days ago

Commit

f0a99db

·

verified ·

1 Parent(s): 5197186

Create README.md

Files changed (1) hide show

README.md +43 -0

README.md ADDED Viewed

	@@ -0,0 +1,43 @@

+---
+base_model:
+- ibm-granite/granite-4.0-h-350m
+---
+# Granite-4.0-h-350M
+<p align="center">
+  <img src="https://cdn-uploads.huggingface.co/production/uploads/6851901ea43b4824f79e27a9/vBAkkCukOQ3CHlT2GBwvI.png" width="350" height="350">
+</p>
+Run **Granite-4.0-h-350M** optimized for **Qualcomm Hexagon NPUs** with [NexaSDK](https://sdk.nexa.ai) on Android
+## Model Description
+**Granite-4.0-h-350M** is a 350-million-parameter transformer model from IBM’s Granite 4.0 family — designed for efficient inference, low-latency edge deployment, and instruction following at compact scale.
+It shares the same data quality, architecture design, and alignment pipeline as larger Granite 4.0 models but is optimized for lightweight environments where performance per watt and model size are critical.
+Built on the **Granite 4.0** foundation, this model continues IBM’s commitment to open, responsible AI, offering transparency and adaptability for developers, researchers, and embedded AI applications.
+## Features
+- **Compact yet capable**: Delivers high-quality generation and reasoning with just 350M parameters.
+- **Instruction-tuned**: Follows natural language instructions for diverse tasks.
+- **Low-latency performance**: Ideal for CPU, GPU, and NPU inference.
+- **Efficient deployment**: Runs smoothly on edge and resource-constrained devices.
+- **Open and transparent**: Released under IBM’s open model governance framework.
+## Use Cases
+- On-device assistants and chatbots
+- Edge AI and IoT inference
+- Document and text summarization
+- Education and lightweight reasoning tasks
+- Prototype fine-tuning for domain adaptation
+## Inputs and Outputs
+**Input**:
+- Text prompt (instruction or question)
+**Output**:
+- Generated text response completing or following the input prompt
+## License
+This model is released under the **Creative Commons Attribution–NonCommercial 4.0 (CC BY-NC 4.0)** license.
+Non-commercial use, modification, and redistribution are permitted with attribution.
+For commercial licensing, please contact **[email protected]**.