核心能力
- 网页浏览
- 元素交互
- 截图
- 信息提取
browse 让 AI 使用浏览器进行自动化操作。它的核心是:通过浏览器完成网页交互、自动化测试、信息抓取等任务。
作为程序员,你可能遇到过:
browse 就是解决这些的。
# 访问网页browse https://example.com
# 获取页面内容browse --content https://example.com# 点击按钮click "#submit-btn"
# 填写表单fill "#username", "myuser"fill "#password", "mypass"
# 选择下拉框select "#country", "CN"# 截图screenshot
# 元素截图screenshot "#modal"# 提取文本extract text from ".article-title"
# 提取链接extract links from "nav a"
# 提取表格extract table from ".data-table"你想测试用户注册流程:
1. 打开注册页 browse https://example.com/register
2. 填写表单 fill "#email", "test@example.com" fill "#password", "Password123!"
3. 点击注册 click "#register-btn"
4. 验证结果 wait for ".success-message"你想抓取新闻列表:
1. 打开新闻页 browse https://news.example.com
2. 提取标题 extract text from ".news-title"
3. 提取链接 extract links from ".news-item a"
4. 提取内容 extract text from ".news-content"你想验证页面 UI:
1. 打开页面 browse https://example.com
2. 截图 screenshot
3. 检查元素 exists "#modal" visible ".sidebar"
4. 检查样式 get css "#button", "background-color"# 打开页面open https://example.com
# 刷新refresh
# 返回back
# 前进forward
# 等待wait 3 secondswait for ".content"# 点击click "#button"click ".menu-item:first"
# 输入type "search query"fill "#input", "value"
# 选择select "#dropdown", "option-value"check "#checkbox"uncheck "#checkbox"
# 滚动scroll downscroll to bottomscroll to "#element"# 提取文本get textget text from ".title"
# 提取属性get href from "a"get src from "img"
# 提取 HTMLget htmlget html from ".container"1. 打开商品页 browse https://shop.example.com/product/123
2. 添加到购物车 click ".add-to-cart"
3. 打开购物车 click ".cart-icon"
4. 点击结算 click ".checkout"
5. 填写信息 fill "#name", "张三" fill "#phone", "13800138000" fill "#address", "北京市..."
6. 选择支付 select "#payment", "alipay"
7. 确认订单 click ".confirm-order"
8. 验证结果 wait for ".order-success"1. 打开目标网站 browse https://example.com
2. 检查关键元素 exists ".login-button" exists "#user-avatar"
3. 截图记录 screenshot "homepage.png"
4. 检查响应时间 measure load time# 动态内容需要等待wait for ".dynamic-content"
# 不要假设页面立即加载完成# 进入 iframeswitch to frame "iframe-name"
# 退出 iframeswitch to default# 处理 alertaccept alertdismiss alert
# 处理确认框accept confirmdismiss confirm核心能力
使用场景
常用命令
查看源文件: GitHub原始文件