nexaml commited on
Commit
f0a99db
·
verified ·
1 Parent(s): 5197186

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +43 -0
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - ibm-granite/granite-4.0-h-350m
4
+ ---
5
+ # Granite-4.0-h-350M
6
+
7
+ <p align="center">
8
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/6851901ea43b4824f79e27a9/vBAkkCukOQ3CHlT2GBwvI.png" width="350" height="350">
9
+ </p>
10
+
11
+ Run **Granite-4.0-h-350M** optimized for **Qualcomm Hexagon NPUs** with [NexaSDK](https://sdk.nexa.ai) on Android
12
+
13
+ ## Model Description
14
+ **Granite-4.0-h-350M** is a 350-million-parameter transformer model from IBM’s Granite 4.0 family — designed for efficient inference, low-latency edge deployment, and instruction following at compact scale.
15
+ It shares the same data quality, architecture design, and alignment pipeline as larger Granite 4.0 models but is optimized for lightweight environments where performance per watt and model size are critical.
16
+
17
+ Built on the **Granite 4.0** foundation, this model continues IBM’s commitment to open, responsible AI, offering transparency and adaptability for developers, researchers, and embedded AI applications.
18
+
19
+ ## Features
20
+ - **Compact yet capable**: Delivers high-quality generation and reasoning with just 350M parameters.
21
+ - **Instruction-tuned**: Follows natural language instructions for diverse tasks.
22
+ - **Low-latency performance**: Ideal for CPU, GPU, and NPU inference.
23
+ - **Efficient deployment**: Runs smoothly on edge and resource-constrained devices.
24
+ - **Open and transparent**: Released under IBM’s open model governance framework.
25
+
26
+ ## Use Cases
27
+ - On-device assistants and chatbots
28
+ - Edge AI and IoT inference
29
+ - Document and text summarization
30
+ - Education and lightweight reasoning tasks
31
+ - Prototype fine-tuning for domain adaptation
32
+
33
+ ## Inputs and Outputs
34
+ **Input**:
35
+ - Text prompt (instruction or question)
36
+
37
+ **Output**:
38
+ - Generated text response completing or following the input prompt
39
+
40
+ ## License
41
+ This model is released under the **Creative Commons Attribution–NonCommercial 4.0 (CC BY-NC 4.0)** license.
42
+ Non-commercial use, modification, and redistribution are permitted with attribution.
43
+ For commercial licensing, please contact **[email protected]**.