top of page

AI Agent for Webpage Content Extraction

An n8n workflow combining LangChain AI (ReAct agent) and OpenAI GPT-4 to fetch, clean, and convert webpage HTML content to Markdown. It uses HTTP requests, regex for HTML body extraction, and options to simplify or fully preserve content for intelligent web data retrieval and processing.

bottom of page