Parsera
parseraParsera is a lightweight Python library for scraping websites using large language models (LLMs).
Acciones disponibles (2)
Cada acción es una operación que el agente puede ejecutar contra este conector. Haz clic en una acción para ver sus parámetros.
Extract MarkdownPARSERA_EXTRACT_MARKDOWNAcciónTool to extract markdown content from a file or url.
PARSERA_EXTRACT_MARKDOWNAcciónTool to extract markdown content from a file or url.
Parámetros de entrada
urlstringURL of the page to extract markdown from.
file_pathstringLocal path to the document file to be uploaded for extraction.
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not
Parse Content with ParseraPARSERA_PARSE_CONTENTAcciónTool to parse and extract structured data from provided html or text. use after obtaining raw content.
PARSERA_PARSE_CONTENTAcciónTool to parse and extract structured data from provided html or text. use after obtaining raw content.
Parámetros de entrada
promptstringPrompt instructions for parsing.
contentstringObligatorioThe raw content or document to be parsed.
optionsobjectAdditional parsing options as a JSON object.
attributesobjectAttributes or instructions to guide parsing (e.g., filter by invoice type).
content_typestringObligatorioThe MIME type or type identifier of the content (e.g., 'text/html', 'application/pdf').
output_formatstringenumDesired format of the parsed output.
jsonxml
Parámetros de salida
dataobjectObligatorioData from the action execution
errorstringError if any occurred during the execution of the action
successfulbooleanObligatorioWhether or not the action execution was successful or not