Skip to main content
To use HuggingFace Inference APIs with Hymalaia, follow the instructions below.

๐Ÿงพ Prerequisites

You must have a Pro Account with HuggingFace to obtain an API key.
โš ๏ธ Note: As of November 2023, HuggingFace no longer supports very large models (over 10GB) like LLaMA-2-70B on the Pro Plan. Youโ€™ll need to:
  • Use a dedicated Inference Endpoint (paid)
  • Or subscribe to an Enterprise Plan
The Pro Plan still works with smaller models, but these may yield suboptimal results for Hymalaia.

๐Ÿ”‘ Get Your Access Token

  1. Go to your HuggingFace user settings.
  2. Copy your User Access Token (HFAccessToken).

โš™๏ธ Set Up Hymalaia with HuggingFace

Refer to your deployment-specific documentation for setting environment variables.

๐Ÿง  Using LLaMA-2-70B via Inference API

To configure Hymalaia for next-token generation using HuggingFaceโ€™s Inference API:
  1. Navigate to the LLM page in the Hymalaia Admin Panel.
  2. Add a Custom LLM Provider with the following identifiers:
HFCustomLLMProvider1
HFCustomLLMProvider2
These custom providers allow Hymalaia to route prompt completion requests to the HuggingFace-hosted model endpoint.
For more detailed setup and environment configuration examples, refer to the Model Configs.