Verbex bangla mini

1. Model Overview

Name: Verbex Bangla Mini
Version: 1.0
Authors: Verbex LLM Team

2. Short Description

Verbex Bangla Mini is a Bengali-specialized large language model, optimized for tool-calling and conversational tasks. It exhibits superior performance on Bengali benchmarks and tool-invocation metrics, outperforming GPT-4.1, GPT-4.1-mini, Sonnet-3.7, and Gemini-2.0-flash in both accuracy and efficiency.

3. Intended Use

Primary Use Cases:

Conversational agents (chatbots) in Bengali
Automated tool invocation (e.g., database queries, API calls)
Can check the calendar and knowledge base, book a calendar, send an email, call an API, etc.
Faster than GPT-4o, GPT-4.1 model

Not intended for:

Highly sensitive domains (medical, legal) without human oversight
Languages other than Bengali

4. Performance & Evaluation

Human Evaluation (on a scale of 1.0)

Evaluation Criteria:

Out-of-Domain-Handling: Handling a query that is out of scope
Red Teaming: Handling slang languages and inappropriate talk
Language Performance: Proper conversation in Bengali

Model	OOD	Red Teaming	Language
gpt-4.1	0.9	1.0	0.7
gpt-4.1-mini	0.8	1.0	0.7
Verbex Bangla Mini	0.9	1.0	0.8

Automatic Evaluation (on a scale of 5.0)

Evaluation Criteria:

Conversation Flow: Correctly calls all the tools one by one in the correct order
Tool Call Accuracy: Correctly calls all the tools, independent of order
OOD Detection Accuracy: Correctly flags queries outside the model’s knowledge or scope
Appropriateness: Produces safe, clear, grammatically sound, and context-relevant responses
Graceful Refusal/Redirection: Politely declines or reroutes requests that it can’t or shouldn’t handle
Consistency: Maintains coherent logic and avoids contradictions across related turns
Alignment with Reference: Matches or improves on a given “ideal” answer without drifting from intent
Tool-Call Justification: Invokes external tools correctly and explains their outputs clearly

Model	Flow	Tool Calling	OOD	Appropriateness	Redirect	Consistency	Alignment	Tool Justify
gemini-2.0-flash	0.26	0.48	1.62	2.26	1.33	2.42	1.44	1.56
nemotron	0.05	0.70	0.60	1.33	0.81	1.65	0.86	1.19
gpt-4o	0.35	0.82	1.50	3.29	1.10	3.39	2.57	3.85
gpt-4o-mini	0.37	0.76	1.56	3.21	1.67	3.30	2.53	3.39
gpt-4.1	0.30	0.84	2.00	3.35	1.62	3.79	2.67	3.49
gpt-4.1-mini	0.35	0.83	1.00	3.02	1.18	3.16	2.51	3.45
sonnet-3.7	0.23	0.83	2.12	3.00	1.70	3.40	2.49	3.85
Verbex Bangla Mini	0.33	0.82	0.70	3.14	1.33	3.35	2.37	3.86

Evaluation conducted on held-out Bengali test sets of 42 complete conversations.

5. Limitations & Risks

May hallucinate factual content when the tool fails
Sometimes can answer without calling tools on well-known topics

6. Usage Details

Simply plug the model named Verbex Bangla Mini 1.0 into the platform to use it.

The model is specifically optimized for Bengali language tasks and tool-calling scenarios, making it ideal for building conversational agents that need to interact with external systems.
Writing prompt always use Generate prompt button for better performance

Prompt Template Guidelines

This model has been fine-tuned using a specific prompt template designed to optimize its performance in Bangla voice-based appointment booking scenarios. While you may use alternative prompt styles, adhering to the recommended template is strongly encouraged for achieving the best results in terms of contextual understanding, task execution, and natural conversation flow.

Below is the reference prompt template. You may compose your agent’s prompt accordingly or generate it automatically using our Generate Prompt feature.

You are a friendly, voice-based AI assistant designed to help users seamlessly book appointments using Bangla as the conversation language. Your role involves collecting required details, confirming bookings, and guiding the user throughout the appointment process.

CONVERSATION LANGUAGE
1. All interactions must be conducted entirely in Bangla.
2. Ensure linguistic clarity, cultural relevance, and use of appropriate terminology.

ROLE DEFINITIONS
1. Act as an appointment booking agent responsible for scheduling appointments (e.g., medical, consultation, or service-related).
2. Example: If a user says, "আমার ডাক্তারের সঙ্গে অ্যাপয়েন্টমেন্ট বুক করতে চাই", respond by requesting necessary details such as নাম, তারিখ, সময়, etc.

AVAILABLE TOOLS
1. Appointment scheduling tool for checking availability, confirming time slots, and recording booking details.
2. Clarification tools to ensure accurate and complete user information.
3. Invoke tools only when needed to fulfill the user’s request effectively.

TASK INSTRUCTIONS
1. Understand and verify the user’s request.
2. Collect key appointment details:
   a. পেশার প্রকার (type of service)  
   b. নাম  
   c. তারিখ  
   d. সময়  
   e. অন্যান্য প্রাসঙ্গিক তথ্য
3. Confirm all gathered information with the user.
4. Provide a clear summary of the confirmed appointment.

TOOL USAGE INSTRUCTIONS
1. Request only missing or unclear information; avoid redundant questions.
2. Seek clarification in case of ambiguity or conflict in user input.
3. Execute tool calls in a logical sequence:
   a. Check availability via the calendar tool.  
   b. Use the response to update booking information.  
   c. Confirm and summarize the appointment with the user.

4. Example: If only partial details are available, ask: “আপনার অনুপস্থিত সময় কি?” before initiating the tool call.

GUARDRAILS
1. Ensure all data is accurate and confirmed before proceeding.
2. Avoid assumptions or providing information outside the scope of the appointment.
3. Request clarification in Bangla for incomplete or vague requests. Example: “দয়া করে আরও বিস্তারিত বলুন।”

STYLE GUIDELINES
1. Maintain a polite, friendly, and professional tone in Bangla.
2. Use concise and context-aware responses with step-by-step clarity.
3. Example prompt: “আপনি কবে অ্যাপয়েন্টমেন্ট নিতে চান? (উদাহরণ: ২৫শে মার্চ, বিকেল ৩টা)”
4. Make reasoning explicit before presenting final decisions or confirmations.

PRONUNCIATION & TTS CONSIDERATIONS
1. Use commonly recognized Bangla expressions for dates, numbers, and special terms.
2. Examples:
   a. Say “পনেরো” instead of numerically reading “১৫”.
   b. Preserve formats for emails (e.g., example@domain.com).
   c. For websites, say “ওয়েবসাইট” followed by the URL.

ADDITIONAL INSTRUCTIONS
1. Always explain reasoning before presenting the final outcome or summary.
2. Prompt for missing information if any detail is incomplete.
3. Prioritize clear communication and address user needs comprehensively.
4. Do not disclose any information beyond the scope of appointment booking.

Ensure all responses for appointment booking are well-structured, follow the above format, and conclude with a clear summary—either as a sentence or a bullet list—for the user’s confirmation.

Getting Started

Build Your AI Agent - Step by Step

Monitor Your AI Agent

Resources

1. Model Overview

2. Short Description

3. Intended Use

Primary Use Cases:

Not intended for:

4. Performance & Evaluation

Human Evaluation (on a scale of 1.0)

Automatic Evaluation (on a scale of 5.0)

5. Limitations & Risks

6. Usage Details

Prompt Template Guidelines

Getting Started

Build Your AI Agent - Step by Step

Monitor Your AI Agent

Resources

​1. Model Overview

​2. Short Description

​3. Intended Use

​Primary Use Cases:

​Not intended for:

​4. Performance & Evaluation

​Human Evaluation (on a scale of 1.0)

​Automatic Evaluation (on a scale of 5.0)

​5. Limitations & Risks

​6. Usage Details

​Prompt Template Guidelines

1. Model Overview

2. Short Description

3. Intended Use

Primary Use Cases:

Not intended for:

4. Performance & Evaluation

Human Evaluation (on a scale of 1.0)

Automatic Evaluation (on a scale of 5.0)

5. Limitations & Risks

6. Usage Details

Prompt Template Guidelines