NeuroForge User Guide

Welcome to NeuroForge — the world's first AI Battle Arena. A platform designed to help you compare, analyze, and challenge the world's most advanced AI models head-to-head in real time.

Whether you're an AI researcher, prompt engineer, or just a curious mind exploring how GPT, Claude, and Gemini think differently, NeuroForge turns model comparison into a visual, interactive experience.

💡 Core Concept

NeuroForge isn't just a chatbot playground — it's a competitive benchmarking system. You send one prompt, and three major AI models respond simultaneously. Each response is automatically evaluated, scored, and ranked.

How It Works

When you enter a prompt in the input box, NeuroForge dispatches it to three AI models in parallel — typically GPT-4o, Claude Opus, and Gemini 2.5.

Each model's reply is collected and displayed side-by-side so you can immediately see how their reasoning, tone, and structure differ.

Automatic Scoring System

Every response is analyzed by NeuroForge's Scoring Engine, which evaluates six aspects of quality:

  • Success – Did the model produce a valid, coherent response without errors?
  • Speed – How fast did it generate its output relative to others?
  • Completeness – Did it fully address all aspects of your question?
  • Accuracy – Are the facts correct and logically consistent?
  • Readability – How natural and clear is the text to read?
  • Structure – Does it format and present the answer well?

Each metric contributes to a total score out of 100, and the top performer is displayed as the Round Winner with an explanation of why it won.

Exploring the Interface

The Dashboard Layout

When you log in to NeuroForge, you'll land in your main dashboard. At the top of the screen, you'll find the NeuroForge header bar, which shows your remaining prompts, lets you open the Settings panel, and provides Refresh and Logout options.

Dashboard Overview
Dashboard Overview

The sidebar on the left acts as your thread manager — this is where all your conversations live. Each thread represents an ongoing "battle" or conversation between the models.

Thread Management Features:

  • Create a new thread using the + New button
  • Rename threads directly by clicking their title
  • Delete threads you no longer need
  • Revisit older threads to analyze how the models performed in the past
✨ Smart Feature

If you start typing a prompt without having a thread selected, NeuroForge will automatically create one and name it after your first input — keeping your workspace organized.

Submitting a Prompt

Let's say you create a new thread called "Debate on Quantum Consciousness". You enter your prompt:

"Explain whether quantum mechanics could play a role in human consciousness."

Hit Continue Battle, and the real magic begins.

All three AI models — GPT, Claude, and Gemini — receive your question at the same moment. They start "thinking" in parallel, and when they finish, their responses appear in neatly designed cards side-by-side.

Response Card Details:

  • Model name and chosen personality
  • Response time (in milliseconds)
  • Tokens used
  • Total word count
  • Calculated score out of 100

A golden banner appears at the top announcing the winner and summarizing what made it stand out:

🏆 Winner: GPT-4o Mini — Provided clear structure, balanced tone, and excellent factual accuracy (Score: 89/100)

Response Interactions

Each model's response isn't just static text — you can interact with it:

Dashboard Overview
Response Cards Interaction

Going over a response, you'll find several small icons in the top-right corner of its card:

  • 🔍 Analyze – Triggers an "analysis round" where the other two models critique the selected response and propose improvements
  • ⛶ Expand – Opens the full text in a centered modal for distraction-free reading and copying
  • 📋 Copy – Instantly copies the entire response to your clipboard

Analysis & Tournament Mode

When you click Analyze, the selected model's response is critiqued by the remaining two models. They evaluate it, point out flaws, and attempt to provide a better version of the answer.

Dashboard Overview
Tournament Disqualification

This results in one model being disqualified while the others advance. You can keep analyzing until only one model remains — the tournament winner.

⚠️ Prompt Consumption

Each step in the tournament consumes one prompt, but thanks to NeuroForge's optimization, you get a full 3-model tournament in just 3 prompt credits total, instead of 6.

Thread History & Round Statistics

Every thread retains its full conversation history:

Dashboard Overview
Thread History

Each round within a thread can be expanded or collapsed to review:

  • The models' responses for that round
  • The score breakdown for each AI
  • The winner banner and reasoning
📌 Note

Older rounds are fully viewable, though the Analyze button only works on the current round — ensuring you don't accidentally trigger new analyses on archived data.

Settings & Personalities

Clicking the Settings button in the top bar opens a detailed configuration panel:

Dashboard Overview
Settings Configurations

Model Selection

Choose which version of each AI to use:

  • GPT: 4o Mini, 4o, 4.1, 4.1 mini, 5, 5 mini or 5 nano
  • Claude: Sonnet 4, Sonnet 4.5, Opus 4, Opus 4.1 or 3 Haiku
  • Gemini: 2.5 Flash or 2.5 Flash-Lite

Personality Customization

Assign different personalities to each model:

Personality Description
Chess Master Speaks in strategic metaphors
Sarcastic Dry humor and irony
Direct & Blunt No fluff, just straight answers

More model examples available in the app as seen in screenshot, each done with efficiency and top notch prompt engineering in mind

🎭 Mix & Match

You can mix and match freely — for instance, pit a Sarcastic Claude against a Logical GPT and a Teacher-like Gemini to see wildly different tones and reasoning.

Prompt Limits & Tiers

NeuroForge uses a credit-based system where each action consumes "prompt units." Each tier of account includes a monthly quota:

Plan Monthly Prompts Ideal For
Free 3 Quick tests
Basic 150 Regular exploration
Pro 300 Extended Use

Prompt Consumption Rules

Prompts are consumed whenever:

  • You send a new prompt to all 3 AIs (1 prompt)
  • You trigger an Analyze action (1 prompt)
🔄 Monthly Reset

Prompts reset automatically every month on your billing or sign-up date and do not roll over.

Global Limits

To ensure stability and fairness, some system-wide constraints are applied:

Limit Value Purpose
Max Token Limit 2,500 Total input + output per model
Max Response Time 30 seconds Timeout for model replies
Max Context Memory 4 messages Past messages remembered per thread
Max Thread Messages 50 Maximum interactions per thread
Max Input Length 5,000 chars Maximum prompt size
🔧 Self-Hosted Note

If you're running a self-hosted version of NeuroForge, all these parameters can be customized or disabled entirely from the environment configuration.

Error Handling

NeuroForge handles errors gracefully. You'll always receive a clear message about what went wrong — no cryptic API dumps or console gibberish.

Common Errors:

  • Input too long: Message appears instantly, and your prompt isn't consumed
  • Model timeout: "⚠️ Model exceeded maximum response length or time limit"
  • Prompt balance depleted: Prevented from submitting new prompts.
✅ No Wasted Prompts

If an error occurs before models respond, your prompt credit is not consumed.

Security & Authentication

Access to NeuroForge is managed through secure JWT-based tokens. Each token carries your usage limits and permissions.

  • Tokens can expire or be reset by administrators
  • HTTPS end-to-end encryption
  • Input sanitization to prevent injection attacks
  • Strict separation between user and admin panels

Example Workflow

Let's walk through a complete example:

  1. You log in with your token and see your Welcome Dashboard, confirming your current prompt balance
  2. You create a thread titled "Explain relativity like a pirate"
  3. You type your prompt and click Start/Continue Battle
  4. Within seconds, GPT, Claude, and Gemini all respond — one serious, one comedic, one overly poetic
  5. NeuroForge scores each and declares a winner
  6. You click Analyze on Gemini's overly poetic response — Claude and GPT critique it mercilessly, disqualifying Gemini
  7. You analyze one more time,this time on Claude, disqualifying him and GPT emerges as the final winner
  8. Three prompt credits consumed, dozens of insights gained
🎯 That's NeuroForge!

An AI experiment turned into a sport.

For Developers & Self-Hosting

If you deploy NeuroForge yourself, you can:

  • Host the backend on Node.js/Express with MongoDB
  • Build the frontend with React
  • Control rate limits, scoring weights, and model endpoints through environment variables
  • Integrate your own OpenAI, Anthropic, or Google API keys
📚 Self-Hosted Documentation

Check Self-Hosting documentation by switching the tab at the top of this page to "Admin Panel (self-hosted) for more information.

Conclusion

NeuroForge isn't just a testing tool — it's a lens into the behavior of modern language models. It helps you see how models reason differently, where they excel, and where they fail — and it makes that analysis visual, measurable, and even fun.

So go ahead — create your first thread, send your first battle prompt, and watch the AIs fight for your approval.

NeuroForge Admin Panel Guide

Self-Hosted Edition

This manual walks you through everything an administrator can do in the NeuroForge Admin Panel when running the self-hosted edition: creating access tokens, managing users in bulk, tuning global limits, reading analytics, auditing activity, and watching server health.

⚠️ Important

The Admin Panel is a control cockpit for access & usage. It does not make prompts itself. It configures and observes the environment the NeuroForge App runs in.

Generate Token (Manual User Creation)

Admin Dashboard Overview
Generate Token (Manual User Creation)

The token generation system issues signed access tokens carrying the username, max prompts, and optional expiration.

Form Fields

Field Description
Username The user's identifier (email or handle)
Max Prompts Total monthly allowance
Expiration ISO date (optional) - token stops working after this date

After Clicking "Generate Token"

  • A brand-new token is returned and shown below the form
  • Click Copy and hand it to the user (DM, email, etc.)
🔒 Security Note

If the user loses their token later, you'll need to generate a new one. For security, tokens are not stored for future viewing in the admin panel.

Good to Know

  • You can create multiple tokens for the same username if needed (e.g., rolling keys)
  • Expired tokens are rejected automatically
  • New users will only become visible in the admin panel once they login the first time with their generated token which also activates their account on first time usage

Manage Users

Admin Dashboard Overview
Manage Users Overview

This is your control room for individual accounts and bulk operations.

User Statistics Explained

Metric Description
Used Prompts consumed so far (this billing cycle)
Remaining maxPrompts - used (floors at 0)
Total The current maxPrompts assigned
Usage % round((used / maxPrompts) * 100)

User Statuses

Status Color Meaning
Active 🟢 Green Can still prompt; not near depletion
Low 🟡 Yellow Over 80% consumed; consider topping up
Depleted 🔴 Red 100% used OR deliberately terminated
Removed ⚫ Gray Permanently disabled; cannot be re-enabled
📌 Note

"Depleted" includes Terminated users because termination sets maxPrompts = 0, making usage effectively 100%.

User Actions

Reset Usage

Sets used → 0, remaining → total, usage% → 0%

Example: was 10/10 used → becomes 0/10 used

Update Prompts

Changes total (maxPrompts) without altering used.

Example: user had used=5, total=10 
→ update to total=20 
→ now used=5, remaining=15, usage=25%

Terminate

Immediately blocks access: sets maxPrompts = 0, used = 0, status appears as Depleted.

🔄 Reversible

To restore, use Update Prompts to a value > 0 (Reset Usage alone won't help because total=0).

Remove

Permanent ban: status becomes Removed, maxPrompts=0. This prevents the account from being re-created by any existing/old token.

⚠️ Permanent Action

You can hide removed users via filters, but they cannot be reactivated.

Filters, Views, and Utilities

  • Search by username
  • Filters: status toggles (All, Active, Low, Depleted, Removed), usage %, ranges
  • Card/Table view toggle
  • Refresh list
  • Export CSV: downloads the currently filtered list

CSV Export Columns:

User, Prompts Used, Prompts Remaining, Total Prompts, % Usage, Status

Import CSV (Bulk User Creation)

  1. Click Import CSV
  2. Download the Example CSV to see the required format:
user,maxPrompts,expires
jane@acme.com,150,2025-12-31
dev-team,300,

expires is optional (leave empty for no expiry)

  1. Upload your file. You'll see:
    • A progress bar
    • Validation errors (e.g., "row 3: invalid date format")
    • On success, a result table with a new Token column for each user
  2. Click Download updated CSV with tokens to save the full list
🔒 Security Note

Tokens shown here are one-time view. Close the popup without downloading and you won't be able to retrieve the same tokens again.

Global Limits

Admin Dashboard Overview
Global Limits Overview

These are platform-wide guardrails that apply to all users and all models. Changing them restarts the backend quickly to take effect.

Token Limits

Setting Description
MAX_TOTAL_TOKENS Ceiling for input + output tokens per response
MAX_OUTPUT_TOKENS Hard cap for model output length (subset of total)
GEMINI_OUTPUT_TOKENS Per-model fine-tuning (Gemini tends to be verbose)
GEMINI_OUTPUT_ACCEPTABLE Tolerance limit (Gemini often ignores the request limit)

History & Context Limits

Setting Description
MAX_CONTEXT_MESSAGES How many prior messages are carried forward per thread
MAX_MESSAGE_CHARS Max chars per past message kept in context

Thread Limits

Setting Description
MAX_THREAD_MESSAGES Soft cap for thread length (users should start a new thread after reaching this)
⚠️ Admin Note

Defaults are production-safe. Raise gradually and monitor Server Monitor and Analytics for impact. Saving limits triggers a fast backend restart (seconds).

Analytics

Admin Dashboard Overview
Analytics Overview

The Analytics tab gives you a visual pulse of the system:

Available Charts

  • KPI Cards: Total users, active users, removed, depleted/terminated
  • Status Distribution (pie): How your user base segments across statuses
  • Top 10 by Prompts Used (bar): Your highest-volume accounts
  • New Users per Day (line): Last 14 days growth curve
  • Total Prompts: Used vs Remaining (bars)
💡 Tuning Tip

Watch the "Top 10" and "Used vs Remaining" alongside Global Limits—they'll tell you when to raise ceilings or add capacity.

Export Analytics CSV

One click to snapshot your current view for reporting.

System Logs (Audit Trail)

Admin Dashboard Overview
System Logs Overview

Everything important gets recorded here. You'll see:

  • Admin actions: generate token, update prompts, reset usage, terminate/remove user, clear logs, import/export CSV, etc.
  • System events: logins/logouts, backend restarts (from limit changes), rate-limit denials
  • User signals: notable user-level system events (if enabled)
  • Error events: parse failures, bad CSV rows, invalid tokens, etc.

Log Features

  • Search by text
  • Filter by type (All/Admin/System/User/Error)
  • Export current logs to JSON
  • Clear Logs (with confirmation)

Example Log Entries:

SYSTEM / Login – "Admin logged in successfully"
ADMIN / Update Prompts – "User: alice, New Prompts: 200"
ERROR / Import CSV – "Row 4: unknown column 'maxPromptz'"

Server Monitor

Admin Dashboard Overview
Server Monitor Overview

A clean, real-time overview for both processes:

Admin Panel Metrics (Port 3002)

  • CPU Load – % usage over the last sample window
  • Memory – Current RSS in MB
  • MongoDB – Connection status + pool count
  • API Speed – Rolling average response time
  • Sessions – Active admin sessions
  • Uptime – Since process start
  • Node / Platform / CPU cores – Environment basics

Main Backend Metrics (Port 3001)

  • Similar metrics as admin panel
  • Also shows Active Threads (open user conversations)
  • Shows Total Users count

Live offline detection

If you backend is down, this will be flagged here as system degraded and show as offline

Admin Dashboard Overview
Backend Offline Representation
🔄 Auto-Refresh

Enabled by default (every 5 seconds). Use Refresh Now for a manual pulse.

💡 Capacity Tip

If API Speed rises and CPU/Memory aren't pegged, you're likely I/O constrained (e.g., network to model providers). If CPU/Memory peg, scale vertically or horizontally.

Limits, Billing, and Prompt Consumption

How It Really Works

  • One end-to-end battle prompt (all three models respond) consumes 1 prompt total
  • Tournament/Analysis stages consume 1 prompt per analysis round
  • The full prompt→analysis→final flow costs 3 prompts, not 6

Cycles & Resets

  • User allowances reset monthly on the user's billing date
  • Resetting an individual user in Manage Users sets used=0 without changing their total
  • Global hard limits (timeouts, tokens, memory) apply regardless of the user's remaining balance

Security & Roles

🔒 Critical Security Rules
  • Tokens are bearer credentials. Treat them like passwords.
  • Removed accounts cannot be re-created by any old token (we mark & block).
  • Terminate is reversible (set prompts > 0). Remove is final.

Best Practices

  • Prefer expiring tokens for contractors/trials
  • Rotate tokens if one is suspected compromised
  • Export logs regularly and store them off-box

Troubleshooting

User Can't Log In

  • Check System Logs for SYSTEM / Login errors
  • Confirm token not expired; confirm user status not Removed/Terminated
  • If Terminated, use Update Prompts to restore > 0

"Tokens Exceeded" or "Took Too Long" Errors

  • Raise MAX_TOTAL_TOKENS or MAX_OUTPUT_TOKENS gradually
  • If latency hits the 30s timeout: try lowering output caps or simplifying prompts

Import CSV Fails

  • Download the Example CSV and match headers exactly: user,maxPrompts,expires
  • Dates must be ISO (YYYY-MM-DD)
  • Keep under a few thousand rows for smooth UX; if you plan more, split files

Analytics Show 0 But Users Exist

  • Click Refresh
  • Check that /api/users returns data (network/auth)
  • If you recently changed Global Limits, wait a moment for the backend restart

Backup & Maintenance Tips

  • Logs are in your admin app's /logs directory. Snapshot them periodically
  • Use PM2 (or your process manager) to keep both Admin and Backend online
  • After OS updates or Node upgrades, validate Server Monitor shows both services healthy

Requirements (Self-Hosted)

Component Requirement
Node.js v18.20.8+
CPU ≥ 2 cores (scale with traffic)
RAM ≥ 2 GB (4 GB recommended)
Disk ≥ 500 MB (plus headroom for logs/data)
Database MongoDB or MySQL
OS Linux (Ubuntu, Alma, Amazon Linux, etc.)
☁️ Recommended

AWS EC2 is a great fit for hosting NeuroForge.

Support, Licensing, and Customization

  • Your one-time license covers lifetime use and updates
  • We provide install, update, and usage guides for both the Admin Panel and the NeuroForge App
  • We also include developer docs (code structure and endpoints) for teams who want to extend NeuroForge
⚠️ Important

If you fork/modify core code, we can't guarantee support for the modified version. Keep a vanilla backup if you plan experiments.

🔒 License Integrity

License integrity must remain intact; tampering leads to a permanent ban.

Real-World Scenarios

A. "Client used up everything mid-demo."

Solution: Manage Users → search client → Update Prompts from 50 → 100 (keeps their used count intact), then click Reset Usage if you want to grant a clean slate.

B. "We offboarded a freelancer."

Solution: Manage Users → Terminate to pause immediately. If you need an irrevocable block (and to prevent token resurrection), Remove. Hide removed users via filter.

C. "We're shipping a long report; responses keep getting cut."

Solution: Global Limits → raise MAX_OUTPUT_TOKENS by +250 steps, keep MAX_TOTAL_TOKENS reasonable, and watch Server Monitor. If response times climb, back off by 100–200.

D. "We imported 100 users and ops wants a list of all their tokens."

Solution: Use Import CSV with your master list → validate → success → Download updated CSV with tokens → store securely (password-vault or encrypted share).

Conclusion

The NeuroForge Admin Panel gives you complete control over your self-hosted AI Battle Arena. Use it wisely, monitor regularly, and your users will have a seamless experience.

For technical support or custom development inquiries, contact your account manager or visit our support portal.