Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,43 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
base_model:
|
| 3 |
+
- ibm-granite/granite-4.0-h-350m
|
| 4 |
+
---
|
| 5 |
+
# Granite-4.0-h-350M
|
| 6 |
+
|
| 7 |
+
<p align="center">
|
| 8 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/6851901ea43b4824f79e27a9/vBAkkCukOQ3CHlT2GBwvI.png" width="350" height="350">
|
| 9 |
+
</p>
|
| 10 |
+
|
| 11 |
+
Run **Granite-4.0-h-350M** optimized for **Qualcomm Hexagon NPUs** with [NexaSDK](https://sdk.nexa.ai) on Android
|
| 12 |
+
|
| 13 |
+
## Model Description
|
| 14 |
+
**Granite-4.0-h-350M** is a 350-million-parameter transformer model from IBM’s Granite 4.0 family — designed for efficient inference, low-latency edge deployment, and instruction following at compact scale.
|
| 15 |
+
It shares the same data quality, architecture design, and alignment pipeline as larger Granite 4.0 models but is optimized for lightweight environments where performance per watt and model size are critical.
|
| 16 |
+
|
| 17 |
+
Built on the **Granite 4.0** foundation, this model continues IBM’s commitment to open, responsible AI, offering transparency and adaptability for developers, researchers, and embedded AI applications.
|
| 18 |
+
|
| 19 |
+
## Features
|
| 20 |
+
- **Compact yet capable**: Delivers high-quality generation and reasoning with just 350M parameters.
|
| 21 |
+
- **Instruction-tuned**: Follows natural language instructions for diverse tasks.
|
| 22 |
+
- **Low-latency performance**: Ideal for CPU, GPU, and NPU inference.
|
| 23 |
+
- **Efficient deployment**: Runs smoothly on edge and resource-constrained devices.
|
| 24 |
+
- **Open and transparent**: Released under IBM’s open model governance framework.
|
| 25 |
+
|
| 26 |
+
## Use Cases
|
| 27 |
+
- On-device assistants and chatbots
|
| 28 |
+
- Edge AI and IoT inference
|
| 29 |
+
- Document and text summarization
|
| 30 |
+
- Education and lightweight reasoning tasks
|
| 31 |
+
- Prototype fine-tuning for domain adaptation
|
| 32 |
+
|
| 33 |
+
## Inputs and Outputs
|
| 34 |
+
**Input**:
|
| 35 |
+
- Text prompt (instruction or question)
|
| 36 |
+
|
| 37 |
+
**Output**:
|
| 38 |
+
- Generated text response completing or following the input prompt
|
| 39 |
+
|
| 40 |
+
## License
|
| 41 |
+
This model is released under the **Creative Commons Attribution–NonCommercial 4.0 (CC BY-NC 4.0)** license.
|
| 42 |
+
Non-commercial use, modification, and redistribution are permitted with attribution.
|
| 43 |
+
For commercial licensing, please contact **[email protected]**.
|