LangMetres is a powerful web application for evaluating and comparing different language models. It provides a unified interface for testing multiple models simultaneously while tracking costs, tokens, and performance metrics. Thank to LiteLLM, it supports a wide range of LLM providers.
langmetres-480.mov
- Multi-Model Testing: Test multiple language models side by side
- Real-time Metrics: Track token usage, costs, and latency for each response
- Model Configuration: Adjust temperature and top_p settings per model
- Response Comparison: View responses in both markdown and raw formats
- Pass/Fail Tracking: Mark successful responses and track evaluation results
- Import/Export: Save and load test results in Excel format
- Environment Management: Easy configuration of API keys and settings
- Local Storage: Automatically saves test cases and configurations
The easiest way to run LangMetres is using Docker:
docker run -it --rm --name langmetres -p 1905:1905 spksoft/langmetres:latest
Then visit http://localhost:1905
in your browser.
Configure your API keys in the Environment Variables tab. LangMetres supports various LLM providers through LiteLLM. See the LiteLLM documentation for provider-specific configurations.
Example configuration:
OPENAI_API_KEY=sk-...
ANTHROPIC_API_KEY=sk-...
-
Set Environment Variables
- Add your API keys in the Environment Variables tab
- Save to localStorage if desired
-
Select Models
- Choose models from the dropdown
- Configure temperature and top_p for each model
-
Create Test Cases
- Enter your test prompts
- Run tests against all selected models
- View and compare responses
- Mark successful responses with PASS button
-
Export/Import Results
- Export test results to Excel
- Import previous test results
- Results include all metrics and PASS/FAIL status
Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.
- Clone the repository:
git clone https://github.com/spksoft/langmetres.git
cd langmetres
- Install dependencies:
# Frontend
cd frontend
npm install
# Backend
cd ../backend
poetry install
- Run the development servers:
# Frontend
npm run dev
# Backend
poetry run start
If you have any questions or run into issues, please open an issue in the GitHub repository.
Made with ❤️ and 🤖 by @spksoft
Current Version: 0.1.0-alpha.3
MIT