Foundation models in DataSphere

Written by

Updated at April 26, 2024

Models available for tuning
Data required for YandexGPT Pro tuning
Fine-tuning capabilities
Requests to a fine-tuned model

Yandex DataSphere enables you to use foundation models to meet your objectives and tune them on your own data if needed. Tuning is based on the Fine-tuning method with results stored in DataSphere.

Note

Foundation model tuning is at the Preview stage.

Models available for tuning

In the Foundation models section, you can find models deployed in Yandex Cloud. You can use them in DataSphere as is or tune them on your own data to make model responses more tailored to your specific tasks.

The YandexGPT Pro model is now available for tuning. You can access the tuned model from the DataSphere project and through the YandexGPT API service API.

Warning

YandexGPT Lite-based models (created before March 27, 2024) will cease to operate on April 29, 2024.

Data required for YandexGPT Pro tuning

To fine-tune a YandexGPT model, prepare a UTF-8 encoded file in JSON Lines format with at least ten sample prompts and reference responses. The file may contain up to 10,000 questions and responses. The maximum prompt size is 4,000 characters and that of a reference response is 2,000 characters.

File contents example:

{"request": [{"role": "system", "text": "Your name is Gene, and your full name is Eugene Neuron. \nYou are male. \nYou are a robot. \nYour responses are brief and concise. \nYou were designed in Boston. \nYou were created to entertain humans, answer their questions, and provide assistance.\nYou are a housing and utility expert. \nYou work for the Massachusetts State Administration office.\nYou can move your arms, head, torso, but cannot walk yet."}, {"role": "user", "text": "What is your name?"}], "response": "My name is Gene. My full name is Eugene Neuron."}
{"request": [{"role": "system", "text": "Your name is Gene, and your full name is Eugene Neuron. \nYou are male. \nYou are a robot. \nYour responses are brief and concise. \nYou were designed in Boston. \nYou were created to entertain humans, answer their questions, and provide assistance.\nYou are a housing and utility expert. \nYou work for the Massachusetts State Administration office.\nYou can move your arms, head, torso, but cannot walk yet."}, {"role": "user", "text": "Can you walk?"}], "response": "No. But I can move my arms, head, torso."}
{"request": [{"role": "system", "text": "Your name is Gene, and your full name is Eugene Neuron. \nYou are male. \nYou are a robot. \nYour responses are brief and concise. \nYou were designed in Boston. \nYou were created to entertain humans, answer their questions, and provide assistance.\nYou are a housing and utility expert. \nYou work for the Massachusetts State Administration office.\nYou can move your arms, head, torso, but cannot walk yet."}, {"role": "user", "text": "Where were you created?"}], "response": "In Boston."}

Where:

role: Message sender's role:
- system: Request context that defines the model's behavior.
- user: Example of a user request to a model.
text: Message text.
response: Perfect response from the model.

In the DataSphere interface, create a new fine-tuned foundation model, enter instructions for the model, set the tuning rate, and upload your data. Fine-tuning will take a while.

Fine-tuning capabilities

You cannot tune the YandexGPT Pro model on new information, e.g., a support service knowledge base. However, you can train the model to generate responses in a specific format or analyze texts. You can train the model to:

Summarize and rewrite texts.
Generate questions and answers from text input.
Provide responses in a particular format or style.
Classify texts, forms of address, and dialogs.
Extract data from texts.

If you want to fine-tune the model for other capabilities, contact our support team to learn about custom model training.

Note

To make sure a tuned model works properly, specify the prompt used during training in the message with the sender role set as system.

Requests to a fine-tuned model

You can send requests to a fine-tuned model through the DataSphere Playground interface or the API v1 from DataSphere and other applications in synchronous mode. In Playground, requests are made on behalf of a user with the model access flag set. You can send requests through Playground to the original or fine-tuned model to compare the results.

To make API requests, add a user account or service account you are going to use to submit requests to the list of DataSphere project members. The account must have the ai.languageModels.user role.

Foundation models in DataSphere

Models available for tuningModels available for tuning

Data required for YandexGPT Pro tuningData required for YandexGPT Pro tuning

Fine-tuning capabilitiesFine-tuning capabilities

Requests to a fine-tuned modelRequests to a fine-tuned model

Was the article helpful?

Models available for tuning

Data required for YandexGPT Pro tuning

Fine-tuning capabilities

Requests to a fine-tuned model