Spaces:

huggingface
/

HuggingDiscussions

Running

App Files Files Community

[FEEDBACK] Inference Providers

#49

by julien-c - opened Jan 17

Discussion

julien-c

Hugging Face org Jan 17

Any inference provider you love, and that you'd like to be able to access directly from the Hub?

reach-vb

Hugging Face org Jan 28

•

edited Jan 28

Love that I can call DeepSeek R1 directly from the Hub 🔥

from huggingface_hub import InferenceClient

client = InferenceClient(
    provider="together",
    api_key="xxxxxxxxxxxxxxxxxxxxxxxx"
)

messages = [
    {
        "role": "user",
        "content": "What is the capital of France?"
    }
]

completion = client.chat.completions.create(
    model="deepseek-ai/DeepSeek-R1", 
    messages=messages, 
    max_tokens=500
)

print(completion.choices[0].message)

benhaotang

Jan 28

•

edited Jan 28

Is it possible to set a monthly payment budget or rate limits for all the external providers? I don't see such options in billings tab. In case a key is or session token is stolen, it can be quite dangerous to my thin wallet:(

julien-c

Hugging Face org Jan 28

@benhaotang you already get spending notifications when crossing important thresholds ($10, $100, $1,000) but we'll add spending limits in the future

benhaotang

Jan 28

•

edited Jan 28

@benhaotang you already get spending notifications when crossing important thresholds ($10, $100, $1,000) but we'll add spending limits in the future

Thanks for your quick reply, good to know!

sylanaustin

Jan 28

Would be great if you could add Nebius AI Studio to the list :) New inference provider on the market, with the absolute cheapest prices and the highest rate limits...

Hazzzardous

Jan 28

Could be good to add featherless.ai

teentitan

Jan 28

TitanML !!

122 hidden messages

Expand all

huonghtl5

Jun 16

Please add FPT AI Inference http://marketplace.fptcloud.com/
It's a newcomer but already has hundreds of users, thanks to its speed, stability, and competitive price.

Moibe

Jun 16

black-forest-labs/FLUX.1-schnell is not properly working under HFInference provider, is this going to be permanent or there is an issue?

Moibe

Jun 17

Actually it seems that HFInference is not working at all, do we need to use models now only via external providers????

Moibe

Jun 17

The message for any HFInference is: "Our latest automated health check on this model for this provider did not complete successfully." Is this temporary or HFInference won't process certain models anymore, or maybe it is a bug?

deleted

Jun 20

Hi Team,

I want to register as a Inference Providers, Can you please suggest us the way forward process.

Thanks
Cyfuture

deleted

about 1 month ago

Dear Hugging Face Team,

Greetings from Cyfuture AI!

We are reaching out to explore a potential collaboration with Hugging Face. As a rapidly growing enterprise-grade AI solutions provider, Cyfuture AI offers robust and scalable inference capabilities powered by high-performance GPU infrastructure.
We would be keen to join Hugging Face as an official inference provider to support model deployment and inference workloads for your global community. We believe this integration would bring mutual value—enhancing access to affordable, high-speed inference while expanding our reach within the AI ecosystem.
Please let us know the next steps or any prerequisites required to move forward with this partnership.
Looking forward to your response.
Regards,

Cyfuture.ai
Email us at - [email protected]

yijiehong

21 days ago

Hi Hugging Face Team,

We are from GmiCloud (https://inference-engine.gmicloud.ai). We want to be an inference provider on Hugging Face. At gmicloud, we focus on LLM inference optimizations. We started to follow the instruction at https://huggingface.co/docs/inference-providers/register-as-a-provider#register-the-provider. While it needs to reach out first. We’d greatly appreciate any guidance or support from the community on how to move forward with becoming an official inference provider on the platform.

Thanks in advance!

Gmi Cloud AI
Email us at - [email protected]

Swarmind-ai

about 13 hours ago

Dear Hugging Face Team,

We're reaching out from Swarmind.ai, a high-performance AI infrastructure company, to express interest in becoming an official inference provider on your platform.

We offer scalable, GPU-powered inference optimized for production workloads, and believe this integration would benefit both communities.

Let us know the next steps to move forward.

Best,
Swarmind Team
[email protected]

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment