Research Assistant AI Feature

Overview

The Research Assistant is an AI-powered writing assistant that helps authors find and reference information on topics they're writing about. It uses AI tool calling to autonomously search the web, fetch and parse relevant web pages, and synthesize the findings into a well-organized research summary with proper citations.

How It Works

User Input: Author provides a topic to research (either by typing or selecting text in the editor)
AI Tool Calling: The AI agent autonomously decides which tools to use:
- web_search - Search DuckDuckGo for relevant pages
- read_webpage - Fetch and extract content from a single URL
- fetch_top_pages - Batch fetch multiple URLs at once
AI Synthesis: OpenAI GPT-4o analyzes the collected information and produces:
- A comprehensive summary
- Key facts and findings
- Properly cited sources
- Suggested text for the author's document

Architecture

┌─────────────────┐     ┌──────────────────┐     ┌───────────────────┐
│  Editor/Modal   │────▶│   Controller     │────▶│  ResearchAgent    │
│  (Stimulus JS)  │◀────│  (Rails)         │◀────│  (ActiveAgent)    │
└─────────────────┘     └──────────────────┘     └───────────────────┘
                                                          │
                                                          ▼
                                                 ┌─────────────────┐
                                                 │    OpenAI       │
                                                 │    GPT-4o       │
                                                 │  (Tool Calling) │
                                                 └─────────────────┘
                                                          │
                              ┌────────────────┬──────────┴──────────┐
                              ▼                ▼                     ▼
                     ┌─────────────────┐ ┌─────────────────┐ ┌─────────────────┐
                     │   web_search    │ │  read_webpage   │ │ fetch_top_pages │
                     │  (DuckDuckGo)   │ │   (Nokogiri)    │ │   (Nokogiri)    │
                     └─────────────────┘ └─────────────────┘ └─────────────────┘

Files

Agent

app/agents/research_assistant_agent.rb - Main agent class with tool methods for web research

Views

app/views/research_assistant_agent/research.text.erb - Prompt template for research action
app/views/research_assistant_agent/instructions.text.erb - System instructions

Tool Schemas (JSON Views)

app/views/research_assistant_agent/tools/web_search.json.erb - Web search tool definition
app/views/research_assistant_agent/tools/read_webpage.json.erb - Page reader tool definition
app/views/research_assistant_agent/tools/fetch_top_pages.json.erb - Batch page fetcher tool definition

Controller

app/controllers/assistants_controller.rb - Added research action and streaming support

Routes

config/routes.rb - Added post "research" => "assistants#research"

Frontend

app/views/pages/_house_toolbar.html.erb - Added research button with search icon
app/javascript/controllers/ai_modal_controller.js - Added 'Researching topic...' label

Tests

test/agents/research_assistant_agent_test.rb - Unit tests for the agent

Usage

From the Editor Toolbar

Open a page in edit mode
Optionally select text you want to research (or leave empty to use full content as topic)
Click the "Research" button (magnifying glass with + icon) in the AI tools section
The AI modal will open and show "Researching topic..."
Wait for the research to complete
Review the findings, then:
- Click "Apply" to insert into document
- Click "Copy" to copy to clipboard
- Click "Discard" to close without changes

Programmatic Usage

# Non-streaming
result = ResearchAssistantAgent.with(
  topic: "climate change effects on coral reefs",
  context: "scientific article",
  depth: "standard"
).research.generate_now

# Streaming (via controller)
POST /assistants/stream
{
  "action_type": "research",
  "topic": "artificial intelligence in healthcare",
  "full_content": "Current document content...",
  "context": "Academic paper"
}

Configuration

The research agent uses the Ollama provider with the gpt-oss:20b model for local inference. Ensure Ollama is running:

# Start Ollama server
ollama serve

# Pull the model
ollama pull gpt-oss:20b

The tool schemas follow OpenAI's Chat API format for compatibility:

{
  "type": "function",
  "function": {
    "name": "tool_name",
    "description": "...",
    "parameters": {...}
  }
}

Web Search Implementation

The agent uses DuckDuckGo's HTML interface for searches (no API key required):

Search: Queries html.duckduckgo.com/html/ with URL-encoded topic
Parse Results: Extracts titles, URLs, and snippets using Nokogiri
Fetch Pages: Downloads top 3 result pages with browser-like headers
Extract Content: Removes navigation, ads, scripts; keeps main content
Limit Size: Truncates content to prevent token overflow

Output Format

The AI produces structured output:

### Summary
2-4 paragraphs summarizing key findings

### Key Facts
- Bullet points of important facts
- Statistics and specific details

### Sources
1. [Source Title](URL) - Brief description
2. [Another Source](URL) - What it contributes

### Suggested Text
Paragraphs the author can incorporate with inline citations

Limitations

Searches limited to 8 results, fetches top 3 pages
Page content truncated to 6000 characters each
Some pages may block automated fetching
Real-time/breaking news may not be immediately available
No image or video analysis

Future Enhancements

Add support for specific domain searches (e.g., Wikipedia only)
Implement caching of recent searches
Add depth levels (quick/standard/thorough)
Support for academic paper searches (arXiv, Google Scholar)
Multi-query refinement based on initial results
Fact-checking against multiple sources

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Research Assistant AI Feature

Overview

How It Works

Architecture

Files

Agent

Views

Tool Schemas (JSON Views)

Controller

Routes

Frontend

Tests

Usage

From the Editor Toolbar

Programmatic Usage

Configuration

Web Search Implementation

Output Format

Limitations

Future Enhancements

FilesExpand file tree

research-assistant.md

Latest commit

History

research-assistant.md

File metadata and controls

Research Assistant AI Feature

Overview

How It Works

Architecture

Files

Agent

Views

Tool Schemas (JSON Views)

Controller

Routes

Frontend

Tests

Usage

From the Editor Toolbar

Programmatic Usage

Configuration

Web Search Implementation

Output Format

Limitations

Future Enhancements