SambaNova

company

Verified

https://sambanova.ai/

SambaNovaAI

sambanova

Activity Feed

Inference Provider

VERIFIED

73,618 monthly requests

AI & ML interests

None defined yet.

Recent Activity

hongfenglu authored a paper 11 days ago

Training Domain Draft Models for Speculative Decoding: Best Practices and Insights

hongfenglu authored a paper 12 days ago

On the Tool Manipulation Capability of Open-source Large Language Models

hongfenglu authored a paper 12 days ago

Synthetic Document Question Answering in Hungarian

View all activity

Papers

On the Tool Manipulation Capability of Open-source Large Language Models

View all Papers

Articles

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 490

Amitabhab

in sambanovasystems/trip-planner 5 months ago

Interesting use case for using the crew platform

#1 opened 5 months ago by

llmchamp

CobyDAdams

in sambanovasystems/trip-planner 5 months ago

Interesting use case for using the crew platform

#1 opened 5 months ago by

llmchamp

Amitabhab

updated a Space 6 months ago

Travel Planner

🌍

Plan your itinerary with the help of AI

varunbk

updated a dataset 6 months ago

sambanovasystems/attackqa

Updated Apr 18 • 58 • 2

varunbk

published a dataset 6 months ago

sambanovasystems/attackqa

Updated Apr 18 • 58 • 2

zolicsaki

updated a Space 7 months ago

Auto Web Search

⚡

Ask questions and get answers with web search integration

bol20162021

updated a model 9 months ago

sambanovasystems/QwQ-0.5B-SFT-Draft

Text Generation • 0.5B • Updated Jan 24 • 1

kz919

updated a model 9 months ago

sambanovasystems/QwQ-0.5B-SFT-Draft

Text Generation • 0.5B • Updated Jan 24 • 1

kz919

posted an update 10 months ago

Post

1693

Mini-QwQ an edge device friendly reasoning model distilled from QwQ-32B
🤗: kz919/QwQ-0.5B-Distilled-SFT
🇬 🇬 🇺 🇫: kz919/QwQ-0.5B-Distilled-SFT-gguf
🤖: kz919/Mini-QwQ

kz919

updated a Space 11 months ago

QwQ-32B-Preview

🔍

QwQ-32B-Preview

kz919

authored a paper 11 months ago

Cautious Optimizers: Improving Training with One Line of Code

Paper • 2411.16085 • Published Nov 25, 2024 • 20

zolicsaki

posted an update about 1 year ago

Post

1321

We’ve open-sourced an app, powered by SambaNova Cloud and Llama 405B, that intelligently detects when a web search is needed—then answers directly or with RAG.

sambanovasystems/auto-web-search

🥚 A hidden Easter egg is that Auto Search detection is already trained into Llama 3.1 checkpoints. Simply use the tool usage system prompt below, and the model will either respond with a web search query if it deems necessary or respond to the query directly.🥚

Environment: IPython
Tools: Brave Search
Knowledge Cutoff Date: December 2023
Today's Date: September 2024
You are a helpful assistant. Reminder:
Search function calls MUST follow the specified format: "brave_search.call(query)"

You can see the documentation here
https://www.llama.com/docs/model-cards-and-prompt-formats/llama3_1#built-in-tooling
and read about how the tool usage was trained into Llama3.1 models in section 4.3.5 here https://arxiv.org/pdf/2407.21783

kz919

posted an update about 1 year ago

Post

1636

Just for the meme.

But the clear lesson I learnt from building these demos are, the more powerful the underlying base model is, the closer you will get to GPT4o1. CoT is nothing more than simply inducing the latent reasoning capability from the model.

kz919/GPT4-O1-Proximas

kz919

posted an update about 1 year ago

Post

1922

https://huggingface.co/spaces/kz919/Llama3.1-Instruct-O1

zolicsaki

posted an update about 1 year ago

Post

1349

Fast inference is no longer a nice-to-have demo; it will be the driving force behind future frontier models. Time to switch over to custom AI hardware and short Nvidia.

Try out SambaNova's lightning fast API for free at https://sambanova.ai/fast-api?api_ref=444868

kz919

posted an update about 1 year ago

Post

2463

"It's Sunday night, fancy a game?"
https://kz919-can-you-beat-405b-in-chess.hf.space/
built with the one and only SN fast API:
https://sambanova.ai/fast-api?api_ref=907266

7 replies

kz919

posted an update about 1 year ago

Post

648

Good lord... Spent almost a day debugging this and it turns out it was an issue of gradio update incompatible with the new fastapi.
https://discuss.huggingface.co/t/huggingface-space-failed-after-working-initially/105514/8

Finally got it back online! Come chat with your favorite anime characters here:
kz919/Persona-AI

kz919

posted an update about 1 year ago

Post

1608

Spent a few minutes to build an alternative to Character AI on top of llama3.1 405B through SambaNova's super fast inference API

Space: kz919/Persona-AI
API referral link: https://sambanova.ai/fast-api?api_ref=907266

3 replies

kz919

posted an update about 1 year ago

Post

1699

The only 405B spaces still freely accessible are powered by SN fast api.

xianbao/SambaNova-fast

https://sambanova.ai/fast-api?api_ref=907266

zolicsaki

posted an update about 1 year ago

Post

1818

You can run Llama405B at over 100 tokens per second for free using SambaNova's API! https://sambanova.ai/fast-api?api_ref=444868

I have been able to generate some high quality synthetic data and use it as an LLM as a judge instead of the slower and more expensive alternatives like openAI or Anthropic.

2 replies

Inference Provider

AI & ML interests

Recent Activity

Papers

Articles

Welcome to Inference Providers on the Hub 🔥

Team members 157

sambanovasystems's activity

Interesting use case for using the crew platform

Interesting use case for using the crew platform

Travel Planner

Auto Web Search

QwQ-32B-Preview