Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism
AI & ML interests
Language Model, Diffusion Language Model
Recent Activity
models
23
JetLM/SDAR-30B-A3B-Sci-Base
Text Generation
•
31B
•
Updated
•
11
JetLM/SDAR-30B-A3B-Sci
Text Generation
•
31B
•
Updated
•
35
JetLM/SDAR-30B-A3B-Chat
Text Generation
•
31B
•
Updated
•
65
•
2
JetLM/SDAR-8B-Chat
Text Generation
•
8B
•
Updated
•
174
•
2
JetLM/SDAR-4B-Chat
Text Generation
•
4B
•
Updated
•
3.27k
•
2
JetLM/SDAR-1.7B-Chat
Text Generation
•
2B
•
Updated
•
1.35k
•
7
JetLM/SDAR-30B-A3B-Chat-b8
Text Generation
•
31B
•
Updated
•
13
JetLM/SDAR-30B-A3B-Chat-b64
Text Generation
•
31B
•
Updated
•
14
JetLM/SDAR-30B-A3B-Chat-b16
Text Generation
•
31B
•
Updated
•
16
JetLM/SDAR-30B-A3B-Chat-b32
Text Generation
•
31B
•
Updated
•
12
datasets
0
None public yet