@Preview(value="3.9.1") public class WebCrawlerSkill extends AbsSkill
该技能通过高度优化的爬虫驱动,将复杂的 HTML 页面转化为 LLM 友好的 Markdown 格式:
JINA(极简高效)与 FIRECRAWL(深度降噪)双引擎。User-Agent 与请求头,支持令牌(API Key)鉴权以获取更高级的抓取频率。| 限定符和类型 | 类和说明 |
|---|---|
static interface |
WebCrawlerSkill.CrawlerDriver |
| 限定符和类型 | 字段和说明 |
|---|---|
static WebCrawlerSkill.CrawlerDriver |
FIRECRAWL |
static WebCrawlerSkill.CrawlerDriver |
JINA |
| 构造器和说明 |
|---|
WebCrawlerSkill(WebCrawlerSkill.CrawlerDriver driver,
java.lang.String apiKey) |
| 限定符和类型 | 方法和说明 |
|---|---|
java.lang.String |
crawl(java.lang.String url) |
java.lang.String |
description()
获取技能描述
|
java.lang.String |
name()
获取技能名称(默认类名)
|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitgetInstruction, isSupported, onAttachpublic static final WebCrawlerSkill.CrawlerDriver JINA
public static final WebCrawlerSkill.CrawlerDriver FIRECRAWL
public WebCrawlerSkill(WebCrawlerSkill.CrawlerDriver driver, java.lang.String apiKey)
public java.lang.String name()
Skillpublic java.lang.String description()
Skill@ToolMapping(name="crawl_url", description="\u8bfb\u53d6\u5e76\u5206\u6790\u6307\u5b9a URL \u7f51\u9875\u7684\u8be6\u7ec6\u5185\u5bb9") public java.lang.String crawl(@Param(value="url") java.lang.String url)