File size: 1,853 Bytes
cf93ac3 513a2d7 cf93ac3 513a2d7 cf93ac3 513a2d7 cf93ac3 513a2d7 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 |
---
title: Open Computer Use Agent
emoji: π₯οΈ
colorFrom: blue
colorTo: purple
sdk: docker
app_port: 7860
pinned: false
license: mit
---
# Open Computer Use Agent
An open-source alternative to OpenAI's Operator and Anthropic's Computer Use.
## Features
- π₯οΈ Full Linux desktop (Xfce) running in the browser
- π±οΈ Mouse control (click, double-click, right-click)
- β¨οΈ Keyboard input (typing, key combinations)
- π· Real-time screenshots
- π Firefox ESR browser included
## How It Works
This Space runs a virtual X11 desktop using:
- **Xvfb**: Virtual framebuffer for headless display
- **Xfce4**: Lightweight desktop environment
- **xdotool**: Mouse and keyboard automation
- **Gradio**: Web UI for control
## Usage
1. Click "Take Screenshot" to see the current desktop
2. Use the action controls to interact:
- Enter X,Y coordinates and click
- Type text
- Press keyboard shortcuts
- Scroll up/down
## Architecture
```
βββββββββββββββββββββββββββββββββββ
β HuggingFace Spaces Container β
β βββββββββββββ ββββββββββββββ β
β β Xvfb ββββ Xfce4 β β
β β :99 β β Desktop β β
β βββββββ¬ββββββ ββββββββββββββ β
β β β
β βββββββΌββββββ ββββββββββββββ β
β β xdotool ββββ Gradio β β
β β control β β UI :7860 β β
β βββββββββββββ ββββββββββββββ β
βββββββββββββββββββββββββββββββββββ
```
## License
MIT
|