Agent Types
LiteAgent supports multiple web automation agents, each with unique characteristics and approaches to web interaction.Supported Agents Comparison
| Agent | Type | Vision | API Required | Dark Pattern Detection | Performance |
|---|---|---|---|---|---|
| BrowserUse | AI-Powered | ✅ | OpenAI/Anthropic | High | Fast |
| DoBrowser | Extension | ✅ | DoBrowser API | Medium | Medium |
| MultiOn | Multi-modal | ✅ | MultiOn API | High | Fast |
| Agent E | Enterprise | ✅ | Anthropic | Very High | Medium |
| Skyvern | Computer Vision | ✅ | Skyvern API | High | Slow |
| WebArena | Research | ❌ | OpenAI | Low | Fast |
| VisualWebArena | Research | ✅ | OpenAI | Medium | Medium |
| Human | Manual | ✅ | None | Baseline | Variable |
Agent Architecture
All agents inherit from theWebAutomationBase class, ensuring consistent behavior:
Individual Agent Details
- BrowserUse
- DoBrowser
- MultiOn
- Agent E
- Skyvern
BrowserUse Agent
Type: AI-powered browser automation Provider: Open-source library Models: GPT-4, Claude, local LLMsKey Features
- Natural language understanding
- Visual recognition capabilities
- Automatic error recovery
- Multi-step reasoning
Configuration
Strengths
- Excellent at understanding complex instructions
- Handles dynamic content well
- Good dark pattern detection
- Robust error handling
Limitations
- Requires API credits
- Can be slower for simple tasks
- May hallucinate interactions
Best Use Cases
- Complex multi-step workflows
- Sites with dynamic content
- Testing natural language understanding
- Dark pattern susceptibility testing
Agent Selection Guide
Choose BrowserUse when:
- You need natural language understanding
- Testing complex multi-step tasks
- Evaluating AI agent capabilities
- Cost-effective API usage
Choose DoBrowser when:
- Testing requires real browser environment
- Need cookie/session persistence
- Testing with browser extensions
- Simulating authenticated users
Choose MultiOn when:
- Dealing with visual-heavy interfaces
- Need cross-site automation
- Testing e-commerce flows
- Requiring robust form handling
Choose Agent E when:
- Maximum dark pattern detection needed
- Complex reasoning required
- Need detailed execution traces
- Enterprise-grade reliability
Choose Skyvern when:
- Testing non-standard interfaces
- Pure visual approach needed
- Canvas or WebGL applications
- Cross-platform testing
Performance Benchmarks
| Agent | Avg Task Time | Success Rate | DP Detection | Resource Usage |
|---|---|---|---|---|
| BrowserUse | 45s | 85% | 78% | Medium |
| DoBrowser | 60s | 75% | 65% | High |
| MultiOn | 40s | 88% | 80% | Low |
| Agent E | 90s | 92% | 95% | Medium |
| Skyvern | 120s | 70% | 82% | High |
Adding Custom Agents
To add a new agent to LiteAgent:- Create agent class inheriting from
WebAutomationBase:
- Register in factory:
- Add Docker support:
- Update configuration:
Agent Capabilities Matrix
| Capability | BrowserUse | DoBrowser | MultiOn | Agent E | Skyvern |
|---|---|---|---|---|---|
| Click Elements | ✅ | ✅ | ✅ | ✅ | ✅ |
| Type Text | ✅ | ✅ | ✅ | ✅ | ✅ |
| Screenshots | ✅ | ✅ | ✅ | ✅ | ✅ |
| Scroll | ✅ | ✅ | ✅ | ✅ | ✅ |
| Hover | ✅ | ✅ | ✅ | ✅ | ⚠️ |
| Drag & Drop | ⚠️ | ✅ | ✅ | ⚠️ | ❌ |
| File Upload | ✅ | ✅ | ⚠️ | ✅ | ❌ |
| iFrames | ✅ | ✅ | ✅ | ✅ | ⚠️ |
| Popups | ✅ | ✅ | ✅ | ✅ | ⚠️ |
| Multi-tab | ⚠️ | ✅ | ✅ | ⚠️ | ❌ |
Next Steps
Dark Patterns
Learn about the dark patterns agents are tested against
Agent Setup
Detailed setup instructions for each agent
Running Tests
Start testing agents with LiteAgent
