Blog | Home

Building an AI Research Agent with Firecrawl

December 10, 2025 · 8 min read

Software Developer

The challenge facing modern AI applications isn't the intelligence itself. It's feeding that intelligence with clean, structured data. Anyone who has tried scraping the web at scale knows the pain: navigating dynamic JavaScript rendering, managing rate limits, dealing with CAPTCHA walls, and parsing inconsistent HTML structures. These aren't trivial problems.

This is where purpose-built tooling makes the difference. In this guide, we'll build a research agent that autonomously searches the web, extracts structured content, and synthesizes findings using an LLM. Think of it as a proof-of-concept for more ambitious systems like market intelligence dashboards, competitive analysis tools, or automated research assistants.

The Architecture

Our agent follows a three-phase pipeline:

Discovery - Search for relevant sources based on a query
Extraction - Convert raw web pages into structured, LLM-ready text
Synthesis - Aggregate and analyze the extracted data

This pattern scales well. The same core loop powers everything from simple Q&A bots to sophisticated autonomous research systems.

What You'll Need

Node.js (v18 or later recommended)
Firecrawl API key from firecrawl.dev
OpenAI API key for the analysis layer

Firecrawl handles the heavy lifting of web scraping - JavaScript rendering, proxy rotation, and anti-bot evasion - so you can focus on building features rather than fighting infrastructure.

Project Setup

Initialize your project and install dependencies:

mkdir research-agent
cd research-agent
npm init -y
npm install @mendable/firecrawl-js openai dotenv

Create a .env file for your credentials:

FIRECRAWL_API_KEY=fc-YOUR_KEY_HERE
OPENAI_API_KEY=sk-YOUR_KEY_HERE

Security note: Never commit API keys to version control. Use a secrets manager for production deployments.

Building the Agent

Phase 1: Discovery Layer

Rather than manually curating URLs, we'll use Firecrawl's search capability to dynamically discover relevant sources. This makes the agent adaptable to any query without hardcoded assumptions.

import Firecrawl from '@mendable/firecrawl-js';
import dotenv from 'dotenv';

dotenv.config();

const firecrawl = new Firecrawl({ apiKey: process.env.FIRECRAWL_API_KEY });

async function searchTopic(query, maxResults = 3) {
  console.log(`🔍 Searching for: "${query}"...`);
  
  const searchResult = await firecrawl.search(query, { limit: maxResults });
  
  if (!searchResult.success || !searchResult.data.length) {
    throw new Error(`No results found for query: ${query}`);
  }

  const urls = searchResult.data.map(item => item.url);
  console.log(`Found ${urls.length} sources`);
  
  return urls;
}

The search API returns ranked results, similar to what you'd see in a traditional search engine. For production use, consider implementing relevance filtering or diversity checks to avoid redundant sources.

Phase 2: Extraction Layer

Web scraping is deceptively complex. Modern sites render content via JavaScript, employ anti-bot measures, and vary wildly in structure. Firecrawl abstracts this complexity, returning clean Markdown that preserves semantic structure without HTML noise.

async function scrapeContent(urls) {
  console.log(`🕷️ Scraping ${urls.length} pages...`);

  const scrapePromises = urls.map(async (url) => {
    try {
      const result = await firecrawl.scrape(url, { 
        formats: ['markdown']
      });
      
      if (result.success && result.markdown) {
        return {
          url,
          content: result.markdown,
          success: true
        };
      }
    } catch (err) {
      console.error(`Failed to scrape ${url}: ${err.message}`);
    }
    
    return { url, success: false };
  });

  const results = await Promise.all(scrapePromises);
  const successful = results.filter(r => r.success);
  
  console.log(`✓ Successfully scraped ${successful.length}/${urls.length} pages`);
  
  return successful.map(r => 
    `SOURCE: ${r.url}\n\n${r.content}\n\n---\n`
  ).join('\n');
}

Why Markdown? LLMs are trained on vast amounts of Markdown text from documentation, GitHub, and technical writing. The format preserves semantic hierarchy - headers, lists, code blocks - while remaining token-efficient, which matters when you're working with context window limits.

Phase 3: Synthesis Layer

With structured data in hand, we can now leverage an LLM to synthesize findings. This is where the magic happens: transforming disparate sources into coherent, actionable insights.

import OpenAI from 'openai';

const openai = new OpenAI({ apiKey: process.env.OPENAI_API_KEY });

async function generateReport(topic, context) {
  console.log('🧠 Analyzing and synthesizing data...');
  
  const completion = await openai.chat.completions.create({
    model: 'gpt-4o-mini',
    temperature: 0.3, // Lower temperature for factual accuracy
    messages: [
      { 
        role: 'system', 
        content: 'You are a research analyst. Synthesize information from multiple sources into a clear, well-structured technical briefing. Cite sources when making specific claims.' 
      },
      { 
        role: 'user', 
        content: `Research Topic: ${topic}\n\nGathered Information:\n\n${context}\n\nProvide a comprehensive summary with key findings and insights.`
      }
    ],
    max_tokens: 1500
  });

  return completion.choices[0].message.content;
}

Model selection matters here. We're using gpt-4o-mini for cost efficiency, but for production research tools, consider gpt-4o for improved reasoning and source citation accuracy.

Putting It All Together

Here's the complete agent orchestrating all three phases:

import Firecrawl from '@mendable/firecrawl-js';
import OpenAI from 'openai';
import dotenv from 'dotenv';

dotenv.config();

const firecrawl = new Firecrawl({ apiKey: process.env.FIRECRAWL_API_KEY });
const openai = new OpenAI({ apiKey: process.env.OPENAI_API_KEY });

async function researchAgent(topic, maxSources = 3) {
  const startTime = Date.now();
  
  try {
    // Phase 1: Discovery
    console.log(`\n🔍 Starting research on: "${topic}"\n`);
    const urls = await searchTopic(topic, maxSources);
    console.log(`Sources identified:\n${urls.map(u => `  • ${u}`).join('\n')}\n`);

    // Phase 2: Extraction
    console.log('🕷️ Extracting content from sources...\n');
    const context = await scrapeContent(urls);
    
    if (!context.trim()) {
      throw new Error('Failed to extract content from any sources');
    }

    // Phase 3: Synthesis
    console.log('🧠 Generating research report...\n');
    const report = await generateReport(topic, context);

    // Output results
    const duration = ((Date.now() - startTime) / 1000).toFixed(2);
    console.log('═'.repeat(60));
    console.log('RESEARCH REPORT');
    console.log('═'.repeat(60));
    console.log(`\n${report}\n`);
    console.log('═'.repeat(60));
    console.log(`✓ Research completed in ${duration}s\n`);

  } catch (error) {
    console.error(`\n❌ Research failed: ${error.message}`);
    process.exit(1);
  }
}

async function searchTopic(query, maxResults = 3) {
  const searchResult = await firecrawl.search(query, { limit: maxResults });
  
  if (!searchResult.success || !searchResult.data.length) {
    throw new Error(`No results found for query: ${query}`);
  }

  return searchResult.data.map(item => item.url);
}

async function scrapeContent(urls) {
  const scrapePromises = urls.map(async (url) => {
    try {
      const result = await firecrawl.scrape(url, { 
        formats: ['markdown']
      });
      
      if (result.success && result.markdown) {
        return `SOURCE: ${url}\n\n${result.markdown}\n\n---\n`;
      }
    } catch (err) {
      console.error(`  ⚠️  Failed to scrape ${url}`);
    }
    return null;
  });

  const results = await Promise.all(scrapePromises);
  const successful = results.filter(r => r !== null);
  
  console.log(`  ✓ Successfully scraped ${successful.length}/${urls.length} pages\n`);
  
  return successful.join('\n');
}

async function generateReport(topic, context) {
  const completion = await openai.chat.completions.create({
    model: 'gpt-4o-mini',
    temperature: 0.3,
    messages: [
      { 
        role: 'system', 
        content: 'You are a research analyst. Synthesize information from multiple sources into a clear, well-structured technical briefing. Cite sources when making specific claims.' 
      },
      { 
        role: 'user', 
        content: `Research Topic: ${topic}\n\nGathered Information:\n\n${context}\n\nProvide a comprehensive summary with key findings and insights.`
      }
    ],
    max_tokens: 1500
  });

  return completion.choices[0].message.content;
}

// Execute the agent
const topic = process.argv[2] || 'Latest developments in WebAssembly';
researchAgent(topic);

Running the Agent

Execute from your terminal:

node agent.js "What is retrieval-augmented generation?"

The agent will discover relevant sources, extract their content, and produce a synthesized report - all in one command.

Taking It to Production

This implementation is a solid foundation, but production systems need additional work. Here's what to consider:

Error Handling - Implement exponential backoff for rate limits and transient failures. Consider circuit breakers for consistently failing sources.

Observability - Add structured logging (Winston, Pino) and metrics to track success rates, latency, and token usage.

Cost Management - Monitor API usage closely. At scale, LLM calls dominate costs. Consider caching frequently requested topics or implementing tiered analysis: quick summaries vs. deep dives.

Content Quality - Not all scraped content is equally useful. Implement filtering based on content length, language detection, or relevance scoring before sending to the LLM.

Rate Limiting - Both Firecrawl and OpenAI have rate limits. Implement request queuing and concurrency controls for batch operations.

Context Window Management - Large scrapes can exceed LLM context windows. Implement chunking strategies or use map-reduce patterns for processing extensive content.

WebSocket Support - For long-running crawls, Firecrawl supports WebSocket connections that provide real-time updates. This is useful when you need to process documents as they arrive rather than waiting for the entire crawl to complete:

const { id } = await firecrawl.startCrawl('https://docs.example.com', {
  limit: 50
});

const watcher = firecrawl.watcher(id, { 
  kind: 'crawl', 
  pollInterval: 2 
});

watcher.on('document', (doc) => {
  // Process each document as it arrives
  console.log('New document:', doc.url);
});

watcher.on('done', (state) => {
  console.log('Crawl complete:', state.status);
});

await watcher.start();

Pagination for Large Datasets - When dealing with extensive crawls, you can control pagination behavior to manage memory and processing:

// Auto-pagination with limits
const crawlLimited = await firecrawl.getCrawlStatus(jobId, {
  autoPaginate: true,
  maxPages: 5,
  maxResults: 100,
  maxWaitTime: 30
});

// Manual pagination for fine control
const crawlPage = await firecrawl.getCrawlStatus(jobId, { 
  autoPaginate: false 
});
// Process crawlPage.data, then fetch next page if crawlPage.next exists

Scaling the Pattern

The Search - Scrape - Synthesize loop is remarkably flexible. Here are some real-world applications:

Competitive Intelligence - Monitor competitor websites for product changes, pricing updates, or new features. Schedule the agent to run daily and alert your team when significant changes are detected.

Market Research - Track industry trends by analyzing news sites, blogs, and technical forums. Aggregate insights weekly to inform strategic decisions.

Documentation Assistants - Aggregate and synthesize scattered documentation into coherent guides. This is particularly useful when working with microservices where documentation is spread across multiple repositories.

Fact-Checking Pipelines - Cross-reference claims across multiple authoritative sources. Useful for journalism, research validation, or content moderation.

Each application shares the same core architecture with domain-specific refinements. The key is identifying what makes sense to automate and what still requires human judgment.

Final Thoughts

Building AI agents isn't about complex algorithms - it's about orchestrating the right tools effectively. By combining specialized APIs like Firecrawl with LLMs, you can build sophisticated research systems without reinventing web scraping infrastructure.

The agent we've built demonstrates the fundamental pattern, but it's just the starting point. The real power comes from iterating on this foundation: adding persistence, implementing feedback loops, or chaining multiple agents together for complex workflows.

Start simple, measure everything, and scale what works. That's how you build systems that last.

Introduction to Civo cloud and Civo kubernetes

March 31, 2025 · 3 min read

Karan

Software Developer

Civo Kubernetes is a managed Kubernetes service designed for speed, simplicity, and cost-effectiveness. Built on K3s, a lightweight Kubernetes distribution, Civo enables developers to deploy, manage, and scale applications effortlessly. Whether you are a startup, an enterprise, or an individual developer, Civo makes Kubernetes management seamless. 🌎

🤔 Why Choose Civo Kubernetes?

⚡ Super Fast Deployment

One of the biggest advantages of using Civo is how fast you can deploy a Kubernetes cluster. Traditional cloud providers like AWS, GCP, and Azure take several minutes to set up a cluster. With Civo, you can create a fully functional Kubernetes cluster in under 90 seconds! 🚀

Example: Creating a Kubernetes cluster using the Civo CLI:

civo k3s create my-cluster --size=g3.k3s.medium --region NYC1

In just a few moments, your cluster will be up and running! 🎉

💰 Budget-Friendly

Kubernetes can be expensive, but Civo offers a cost-effective alternative. Here’s how:

Transparent pricing without hidden fees.
Uses K3s, which requires fewer resources than full-scale Kubernetes, reducing infrastructure costs.
Offers free credits for new users to try out the service. 💸

Example: Checking your cluster’s cost estimate:

civo k3s show-cost my-cluster

🖥️ User-Friendly Interface

Civo makes Kubernetes accessible for everyone:

Simple CLI and Web UI for easy cluster management.
Pre-configured applications available in the marketplace (Databases, Monitoring, CI/CD tools, etc.).
Automated provisioning and monitoring.

For example, deploying a WordPress application from the marketplace is as easy as:

civo marketplace install wordpress --cluster=my-cluster

🏎️ Powered by K3s

Civo Kubernetes is built on K3s, a lightweight Kubernetes distribution that requires fewer resources than traditional Kubernetes installations. This makes it ideal for small teams, developers, and startups who want an optimized Kubernetes experience without unnecessary overhead.

🎯 Built-in Marketplace

Civo has a one-click marketplace that lets you deploy applications and tools without manually configuring them. Some popular apps available: ✅ Databases: PostgreSQL, MySQL, MongoDB ✅ Monitoring: Prometheus, Grafana ✅ Networking: Traefik, Nginx ✅ CI/CD: Jenkins, GitLab Runner

Example: Installing Prometheus for monitoring:

civo marketplace install prometheus --cluster=my-cluster

🌍 Who Should Use Civo Kubernetes?

Startups & Developers 🚀: Get Kubernetes running quickly without managing complex infrastructure.
Businesses & Enterprises 🏢: Scale Kubernetes workloads efficiently while optimizing costs.
DevOps Engineers 🔧: Automate and manage infrastructure using Civo’s CLI and APIs.

🔧 Getting Started with Civo Kubernetes

Ready to try Civo? Follow these steps:

1️⃣ Sign up at Civo.com and claim your free credits.
2️⃣ Install the CLI (optional but recommended):

brew install civo

3️⃣ Create a new Kubernetes cluster:

civo k3s create my-cluster

4️⃣ List your clusters:

civo k3s list

5️⃣ Access your cluster:

civo k3s config my-cluster --save
kubectl get nodes

🎯 Conclusion

Civo Kubernetes is the perfect solution for those looking for a fast, affordable, and developer-friendly Kubernetes platform. Whether you're a beginner or an expert, Civo simplifies Kubernetes management so you can focus on building great applications. 🚀

🔗 Learn more: Civo Kubernetes Documentation 📚

Let's understand Kubernetes and How it works

March 28, 2025 · 4 min read

Karan

Software Developer

🔹 Introduction

Kubernetes (often abbreviated as K8s) is an open-source platform for automating deployment, scaling, and management of containerized applications. It helps developers manage their applications more efficiently and ensures they run reliably.

Think of Kubernetes as an orchestra conductor 🏢🎻. It ensures that all the containers (instruments) work together in harmony.

🌟 Why Kubernetes?

Before Kubernetes, developers used tools like Docker to package applications into containers. But when you have hundreds or thousands of containers, managing them manually becomes a nightmare! 😵 That's where Kubernetes helps:

✅ Automatic Scaling – Adjusts the number of containers based on demand.
✅ Self-Healing – If a container crashes, Kubernetes restarts it automatically.
✅ Load Balancing – Distributes traffic among containers to prevent overload.
✅ Rolling Updates – Updates applications without downtime.
✅ Resource Optimization – Efficiently utilizes CPU and memory.
✅ Multi-Cloud Support – Works across AWS, GCP, Azure, and on-premises environments.

📌 Key Concepts in Kubernetes

1️⃣ Pods 🛶

A Pod is the smallest unit in Kubernetes. It contains one or more containers that share storage and networking. Think of a pod as a box 📦 that holds your application.

Example Pod YAML:

apiVersion: v1
kind: Pod
metadata:
  name: my-pod
spec:
  containers:
    - name: my-container
      image: nginx
      ports:
        - containerPort: 80

2️⃣ Nodes 🖥️

A Node is a machine (physical or virtual) where Kubernetes runs your applications. A cluster consists of multiple nodes.

Master Node 👑: Controls and manages the cluster.
Worker Nodes 🏗️: Run the applications inside containers.

Each worker node contains:

Kubelet – Ensures pods are running correctly.
Container Runtime (Docker/Containerd) – Runs the actual containers.
Kube Proxy – Manages networking and communication between pods.

3️⃣ Deployments 🚀

A Deployment is used to manage multiple pods. It ensures your application is always running the desired number of instances and can handle updates smoothly.

Example Deployment YAML:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: my-deployment
spec:
  replicas: 3  # Run 3 instances
  selector:
    matchLabels:
      app: my-app
  template:
    metadata:
      labels:
        app: my-app
    spec:
      containers:
        - name: my-container
          image: nginx
          ports:
            - containerPort: 80

This creates a Deployment that runs 3 pods of NGINX.

4️⃣ Services 🌐

A Service allows communication between different components of an application (pods). It acts like a load balancer.

Example Service YAML:

apiVersion: v1
kind: Service
metadata:
  name: my-service
spec:
  selector:
    app: my-app
  ports:
    - protocol: TCP
      port: 80
      targetPort: 80
  type: ClusterIP

This service ensures that the pods can communicate with each other inside the cluster.

🔧 How Kubernetes Works (Step-by-Step)

1️⃣ You define your application using YAML files (Pods, Deployments, Services).
2️⃣ Kubernetes API Server schedules and manages the workloads.
3️⃣ Nodes run the containers using a container runtime like Docker or Containerd.
4️⃣ Kubelet (agent on each node) ensures that pods are running as expected.
5️⃣ Kubernetes continuously monitors and maintains the system, ensuring stability.
6️⃣ Load balancing and auto-scaling adjust resources dynamically based on demand.

🏗️ Kubernetes Architecture

Kubernetes follows a Master-Worker architecture:

Master Node Components: API Server, Scheduler, Controller Manager, etcd (database).
Worker Node Components: Kubelet, Kube Proxy, Container Runtime.

Kubernetes Architecture

🎯 Real-World Use Cases of Kubernetes

✅ Running Scalable Web Applications – Deploy and manage websites with high traffic.
✅ Microservices Architecture – Easily manage multiple small services.
✅ CI/CD Pipelines – Automate software delivery workflows.
✅ Hybrid and Multi-Cloud Deployments – Run workloads on any cloud (AWS, GCP, Azure).
✅ Machine Learning & AI – Manage and deploy ML models efficiently.

🎉 Conclusion

Kubernetes makes it easier to deploy, scale, and manage modern applications efficiently. With features like auto-scaling, self-healing, and load balancing, it has become the standard for container orchestration. 🚀

If you're new to Kubernetes, try deploying a small app and explore its powerful capabilities! 💡

Smart Contracts and Rust The Power Duo Driving Blockchain’s Future

January 28, 2025 · 5 min read

Karan

Software Developer

Hey there, blockchain explorer! 🌐

Ever wondered how your favorite decentralized apps (dApps) run so smoothly or how your crypto transactions are magically secure? It all comes down to two incredible tech wonders: Smart Contracts and a programming language that’s gaining massive popularity in the blockchain world — Rust.

Now, if you’re scratching your head thinking, “Wait, why Rust? I thought Solidity was the cool kid in town?” — you’re in the right place! This blog will walk you through what smart contracts really are, why Rust is becoming the go-to language for blockchain devs, and how the two are revolutionizing the decentralized universe.

How modular blockchains and Layer 2 solutions are shaping the future of web3 gaming?

January 19, 2025 · 5 min read

Karan

Software Developer

Hey Web3 and Gaming Enthusiasts!! 👋

As a DevRel engineer, I work a lot on understanding and teaching the underlying architecture, tech products, and the evolution of technology. Today, let’s dive into how modular blockchains work, what Layer 2 solutions are, and how Web3 gaming has revolutionized with this infrastructure. 🚀

The Architecture​

What You'll Need​

Project Setup​

Building the Agent​

Phase 1: Discovery Layer​

Phase 2: Extraction Layer​

Phase 3: Synthesis Layer​

Putting It All Together​

Running the Agent​

Taking It to Production​

Scaling the Pattern​

Final Thoughts​

🤔 Why Choose Civo Kubernetes?​

⚡ Super Fast Deployment​

💰 Budget-Friendly​

🖥️ User-Friendly Interface​

🏎️ Powered by K3s​

🎯 Built-in Marketplace​

🌍 Who Should Use Civo Kubernetes?​

🔧 Getting Started with Civo Kubernetes​

🎯 Conclusion​

🔹 Introduction​

🌟 Why Kubernetes?​

📌 Key Concepts in Kubernetes​

1️⃣ Pods 🛶​

Example Pod YAML:​

2️⃣ Nodes 🖥️​

3️⃣ Deployments 🚀​

Example Deployment YAML:​

4️⃣ Services 🌐​

Example Service YAML:​

🔧 How Kubernetes Works (Step-by-Step)​

🏗️ Kubernetes Architecture​

🎯 Real-World Use Cases of Kubernetes​

🎉 Conclusion​

Hey there, blockchain explorer! 🌐​

Hey Web3 and Gaming Enthusiasts!! 👋​

The Architecture

What You'll Need

Project Setup

Building the Agent

Phase 1: Discovery Layer

Phase 2: Extraction Layer

Phase 3: Synthesis Layer

Putting It All Together

Running the Agent

Taking It to Production

Scaling the Pattern

Final Thoughts

🤔 Why Choose Civo Kubernetes?

⚡ Super Fast Deployment

💰 Budget-Friendly

🖥️ User-Friendly Interface

🏎️ Powered by K3s

🎯 Built-in Marketplace

🌍 Who Should Use Civo Kubernetes?

🔧 Getting Started with Civo Kubernetes

🎯 Conclusion

🔹 Introduction

🌟 Why Kubernetes?

📌 Key Concepts in Kubernetes

1️⃣ Pods 🛶

Example Pod YAML:

2️⃣ Nodes 🖥️

3️⃣ Deployments 🚀

Example Deployment YAML:

4️⃣ Services 🌐

Example Service YAML:

🔧 How Kubernetes Works (Step-by-Step)

🏗️ Kubernetes Architecture

🎯 Real-World Use Cases of Kubernetes

🎉 Conclusion

Hey there, blockchain explorer! 🌐

Hey Web3 and Gaming Enthusiasts!! 👋