Best of the week

models-page.best-models.alt-title

Claude Sonnet 4
Claude Sonnet 4
Claude Sonnet 4 significantly surpasses Sonnet 3.7, showing excellent results in programming and reasoning with high accuracy and controllability. It achieves record performance on the SWE benchmark (72.7%) and is ideal for a wide range of applications, from routine tasks to complex software development projects
o4 Mini High
o4 Mini High
The o4-mini model with a high level of reasoning_effort for thorough reasoning. Combines speed and multimodality with accuracy in STEM and visual tasks in a context of 200K tokens.
Veo 3
Veo 3
Veo 3 is an advanced AI video-generation model designed to support filmmakers and storytellers.
GPT-5
GPT-5
GPT-5 is an advanced OpenAI model with improved reasoning, accuracy, and code quality. It is optimized for complex tasks that require step-by-step thinking and following instructions. It features reduced error rates and enhanced efficiency in programming, writing, and health-related tasks.
Flux-1.1 Pro Ultra
Flux-1.1 Pro Ultra
Enhanced version of the image generation model with support for 4 times higher resolution (up to 4 MP), maintaining a generation speed of 10 seconds per image. The model offers a 'raw mode' for creating more natural images.
Gemini-2.5 Pro Preview
Gemini-2.5 Pro Preview
Google's model capable of 'thinking' before answering for greater accuracy and performance. Leader on the LMArena platform with advanced capabilities in reasoning, coding, and multimodality (text, audio, images, video).
models-page.best-models.link

Available neural network models

Tariff
ELITE
Product
APIDashboard
Currency
USDCAPS
Cost in dollars
ModelContext size (in tokens)Output size (in tokens)Prompt (per 1M tokens)Image prompt (per 1k tokens)Response (per 1M tokens)
400 000128 0001,97015,75
400 000128 0001,41011,25
400 000128 00016,880135
1 047 57632 7682,2509
400 000128 0000,2802,25
4 095100 0002,2509
400 000128 0001,41011,25
400 000128 00011,25011,25
* Our markup on these prices is 5%, which is included in the cost of packages except Basic (Premium and higher)

LLM Request

Cost of a single request in the dashboard
All tariffs
Used tokens + 0.01 $per 1 request
Special attention: The use of Easy Writer is charged differently. For each text generation, Easy Writer charges an additional 0.1 $ per request + the token cost as specified above for a regular LLM request.

Image Generation

Cost of a single generation by models
MidJourney — Relax
26 000 CAPS / 0,04 $ For 1 generation
MidJourney — Fast
52 000 CAPS / 0,08 $ For 1 generation
MidJourney — Turbo
104 000 CAPS / 0,16 $ For 1 generation
Dall-E
33 000 CAPS / 0,05 $ For 1 generation
Flux
40 000 CAPS / 0,06 $ For 1 generation
Stable Diffusion
26 250 CAPS / 0,04 $ For 1 generation

Web Search

Cost of a single web search usage
All tariffs
Used tokens + 0.01 $per 1 request
Link Analysis
0,01 Capsfor 1 character

Video Generation

Cost of creating one second of video
GoogleVeo
450 000 Caps / 0.68 $per 1 second
Runway
30 000 Caps / 0.04 $per 1 second
For video generation in 1080p quality using veo-3, an additional charge of +20% is added

Speech Synthesis

Cost of one speech synthesis
TTS
11 250 Caps / 0.02 $per 1 000 characters
TTS HD
27 225 Caps / 0.04 $per 1 000 characters

Transcription

The cost of one transcription
AssemblyAI — nano
2 000 Caps / 0.003 $Per 1 minute
AssemblyAI — best
5 500 Caps / 0.008 $Per 1 minute
A fixed surcharge on all requests: $0.05 per request, $0.10 for files over 50 MB, $0.50 for files over 500 MB

Embeddings

Model embeddings available through our API.
Cost in CapsCost in dollars
ModelEmbedding dimensionPrompt cost (per 1 token)Prompt cost (per 100,000 tokens)
text-embedding-3-largeThe most efficient embedding model
3 0720,120,16
text-embedding-3-smallIncreased performance compared to the 2nd generation ada embedding model
1 5360,020,02
text-embedding-ada-002The most powerful 2nd generation embedding model, replacing 16 first generation models
1 5360,090,12
text-embedding-3-largeThe most efficient embedding model
3 072Embedding dimension
0,12Prompt cost (per 100,000 tokens)
0,16Prompt cost (per 100,000 tokens)
text-embedding-3-smallIncreased performance compared to the 2nd generation ada embedding model
1 536Embedding dimension
0,02Prompt cost (per 100,000 tokens)
0,02Prompt cost (per 100,000 tokens)
text-embedding-ada-002The most powerful 2nd generation embedding model, replacing 16 first generation models
1 536Embedding dimension
0,09Prompt cost (per 100,000 tokens)
0,12Prompt cost (per 100,000 tokens)

What are Caps?

Caps is the internal currency of the service, used to measure the cost of requests and responses of neural networks. It is fixed and depends on the model complexity: number of parameters, multimodality, and overall power.

    For example:
  • ChatGPT-3.5 — ~1 Caps per token
  • ChatGPT o1-Pro — ~400+ Caps per token
The higher your tariff, the better the price: 1 million Caps is cheaper on Elite than on Basic.

Still have questions?

What are tokens?

Tokens are units of text processing by the neural network, representing parts of words, entire words, or punctuation marks that determine the cost of requests.

How long will 1 million tokens last?

One million tokens of the GPT-4o model are enough to rewrite “The Brothers Karamazov” by F. M. Dostoevsky.

What to do if I run out of tokens?

Purchase additional Caps in your personal account — https://bothub.chat/profile

Why does the neural network pretend to be another?

The neural network does not know what model it is if it is not specified in the system prompt. The “self-identification” of the model without instruction is influenced by many factors, one of them being the model's data training set.

What is context in a neural network?

Context is the amount of information that the neural network retains in memory during a dialogue, affecting the coherence of responses and understanding of previous requests.

What is the context of different neural network models?

GPT o1 Pro and Claude 3.7 Sonnet support up to 200K tokens, Gemini 2.5 Pro works with 1KK, while Gemini 2.0 Pro supports up to 2KK tokens.

What file formats do models read?

Neural networks process TXT, PDF, DOCX, XLSX, CSV, JSON, XML, HTML, as well as images JPG, PNG, and audio files MP3, MP4.

Can neural networks be used for free?

There are free models with the postfix “:free” and “-exp” that can be used for free through a mini-window on the main page, as well as the model page.

How do neural network models differ from each other?

Models differ in the volume of training data, context size, processing speed, specialization in specific tasks, and ability to work with multimodal content.

How to use models via API?

To integrate models into your applications, you need to obtain an API key in your personal account. More details can be found here: https://bothub.chat/api/documentation/ru.

Can neural networks be used to automate business processes?

Neural networks effectively automate routine tasks of document management, data processing, customer support, and analytics, integrating with existing business systems via API.

Support ServiceOpen from 07:00 AM to 12:00 PM
Available neural network models :: BotHub