What Are Sources?
Sources provide context to your agents - they’re documents, files, and links that help your agent make informed decisions.The Power of Context
Without sources, agents only know what’s in their instructions. With sources, agents can reference thousands of pages of documentation, policies, examples, and data.
Why Sources Matter
Domain Knowledge
Upload company policies, SOPs, pricing guides - your agent becomes an expert
Examples
Show your agent past quotes, emails, reports - it learns your style
Live Data
Monitor competitor websites, documentation, news - stay up to date
Verification
Agents cite specific pages and sections - you can verify their reasoning
Types of Sources
Decisional supports a wide variety of source types:Document Sources
- PDFs
- Office Documents
- Images
- Text Files
Best for: Contracts, invoices, forms, reports, manualsSupported features:
- Text extraction (native PDF text)
- OCR (scanned documents and images)
- Tables and structured data
- Multi-page documents
- Bounding box citations
- Upload past quotes so agent can match your formatting
- Upload pricing catalogs for product lookup
- Upload contracts for data extraction
- Upload SOPs for policy compliance
Web Sources
Web Links (URLs)
Web Links (URLs)
How it works: Decisional fetches the webpage content and indexes it.Best for:
- Company websites
- Documentation pages
- Blog posts
- Product pages
- Competitor pricing
- Automatic refresh (daily, weekly, or custom)
- Change detection (notify when page changes)
- JavaScript rendering (for dynamic sites)
API Endpoints
API Endpoints
How it works: Decisional calls your API and uses the response as context.Best for:
- Live data from internal systems
- Third-party API data
- Real-time information
- API keys
- Bearer tokens
- OAuth 2.0
Adding Sources to Your Agent
1
Navigate to Sources
In your agent page, click Sources in the left sidebar
2
Choose Source Type
Click Add Source and select:
- Upload File (PDF, Word, Excel, images, text)
- Add Link (webpage URL)
- Connect Integration (Google Drive, OneDrive, etc.)
3
Upload or Configure
For files: Drag and drop or browse to selectFor links: Paste the URL and set refresh frequencyFor integrations: Authorize and select files/folders
4
Name Your Source
Give it a descriptive name (e.g., “Q4 2024 Price List” instead of “pricelist_v3_final.xlsx”)Why: You’ll reference this name in agent instructions
5
Wait for Processing
Decisional indexes your source. This takes:
- Small PDFs: 10-30 seconds
- Large documents: 1-3 minutes
- Web pages: 30-60 seconds
6
Reference in Instructions
Tell your agent when to use this source:
How RAG Works
Decisional uses Retrieval-Augmented Generation (RAG) - a state-of-the-art technique for document understanding.The RAG Pipeline
1
Indexing
When you upload a document:
- Text extraction: Extract all text from the document
- Chunking: Split into semantic chunks (paragraphs, sections)
- Embedding: Convert text into vector representations
- Indexing: Store in a semantic search index
2
Retrieval
When your agent runs:
- Query: Agent formulates a search query based on its task
- Search: Find the most relevant chunks from your sources
- Ranking: Rank chunks by relevance
- Selection: Select top K chunks (typically 5-10)
3
Generation
Agent uses retrieved context:
- Context: Relevant chunks are provided to the agent
- Reasoning: Agent processes task using context
- Citation: Agent notes which sources it used
- Output: Agent generates result with citations
Bounding Box Citations
For PDFs, Decisional provides exact location citations:Page Number
Which page was referenced
Bounding Box
Exact location on the page (highlighted region)
Quoted Text
The specific text the agent read
- Verification: You can check the agent didn’t hallucinate
- Auditing: Understand exactly what the agent learned
- Compliance: Prove decision-making is based on real data
Click any citation in the run logs to see the exact PDF location highlighted!
Optimizing Source Quality
Better sources = better agent performance.Document Preparation Tips
Use Text-Based PDFs (Not Scans)
Use Text-Based PDFs (Not Scans)
Best: Native PDF with selectable textOkay: Scanned PDF (Decisional will OCR it, but slower and less accurate)How to check: Try to select text in the PDF. If you can, it’s text-based.If you only have scans:
- Use high-resolution scans (300 DPI minimum)
- Ensure good contrast
- Avoid skewed or rotated pages
Remove Irrelevant Content
Remove Irrelevant Content
Remove:
- Cover pages with no useful info
- Blank pages
- Unrelated appendices
- Repetitive boilerplate
Use Clear Formatting
Use Clear Formatting
Good structure:
- Clear headers and sections
- Numbered lists
- Tables with headers
- Consistent fonts
- Text in images (unless necessary)
- Watermarks that obscure text
- Heavy redactions
- Multi-column layouts (if possible)
Split Large Documents
Split Large Documents
Instead of: One 500-page policy manualDo this: Separate PDFs for each policy (Pricing, Returns, Shipping, etc.)Why:
- Faster processing
- More accurate retrieval (agent knows which document to search)
- Easier to update individual policies
Use Descriptive Filenames
Use Descriptive Filenames
Bad:
document.pdffinal_FINAL_v3.pdfIMG_2024.png
Acme_Corp_Pricing_Policy_2024_Q4.pdfInvoice_Template_Standard.pdfProduct_Catalog_Hardware.pdf
Source Organization Strategies
By Workflow Type
- Quote Generation
- Invoice Processing
- Contract Extraction
- CRM Enrichment
Sources to upload:
-
Price Lists (Excel or PDF)
- Current pricing
- Discount tiers
- Product catalog
-
Quote Templates (PDF)
- Past successful quotes
- Shows formatting, tone, structure
-
Pricing Policy (PDF or Word)
- Rules for discounts
- Approval thresholds
- Special terms
Advanced RAG Techniques
Multi-Document Reasoning
Agents can combine information from multiple sources: Example workflow:Temporal Sources
For time-sensitive information, set refresh schedules:1
Add Web Link
Add a URL (e.g., competitor pricing page)
2
Set Refresh Schedule
Choose frequency:
- Hourly (for real-time monitoring)
- Daily (for regular updates)
- Weekly (for less frequent changes)
- Manual (refresh when needed)
3
Enable Change Detection
Get notified when the page changes
Versioned Sources
Keep multiple versions of the same source: Example:Price List 2024 Q1.xlsxPrice List 2024 Q2.xlsxPrice List 2024 Q3.xlsxPrice List 2024 Q4.xlsx(current)
Integration-Based Sources
Connect live data sources through integrations:- Google Drive
- Email Attachments
- Databases
Setup:
- Connect Google Drive integration
- In Sources, click Add from Google Drive
- Select files or folders
- Choose sync frequency
- Two-way sync (changes in Drive appear in Decisional)
- Folder monitoring (new files auto-added)
- Permissions respected (only files you have access to)
Managing Sources
Source Library
View all sources across your workspace:1
Navigate to Sources
Click Sources in the main sidebar (not agent-specific)
2
View All Sources
See all uploaded documents, links, and connected files
3
Organize
- Search by name
- Filter by type (PDF, link, integration)
- Sort by upload date, size, usage
4
Reuse
Attach existing sources to new agents (no need to re-upload)
Source Actions
Update a Source
Update a Source
For files: Upload new version (overwrites old one)For links: Click Refresh Now to re-fetchFor integrations: Sync automatically based on scheduleAll agents using this source immediately see the update.
Delete a Source
Delete a Source
Warning: Deleting a source affects all agents using it.To delete:
- Click the source
- Click Delete
- Confirm
View Source Usage
View Source Usage
See which agents use a source:
- Click the source
- View Used By tab
- See list of agents
Download Original
Download Original
Retrieve the original file you uploaded:
- Click the source
- Click Download Original
Troubleshooting Sources
Source won't upload
Source won't upload
Common causes:
-
File too large (limit: 50MB for PDFs, 20MB for images)
- Solution: Split into smaller files or compress
-
Unsupported format
- Solution: Convert to supported format (e.g., DOC → DOCX)
-
Corrupted file
- Solution: Re-export or re-save the file
-
Network timeout
- Solution: Try again with better internet connection
Agent isn't using my source
Agent isn't using my source
Debugging steps:
-
Check source is attached to agent
- Go to Sources tab in agent page
- Verify source is listed
-
Reference source in instructions
- Explicitly mention: “Use [source name] to find…”
-
Verify source is indexed
- Check for “Indexed” status (not “Processing”)
-
Check source quality
- Is text extractable? Try selecting text in PDF
- Is content relevant to agent’s task?
-
Review run logs
- See if agent attempted to search source
- Check retrieval results
Wrong information retrieved
Wrong information retrieved
Problem: Agent cites irrelevant sectionsSolutions:
-
Improve source quality
- Remove irrelevant pages
- Split into focused documents
-
Be more specific in instructions
- Instead of: “Look up pricing”
- Do: “Find product X in ‘Price List’ and return the ‘Unit Price’ column value”
-
Add examples
- Show agent what good results look like
-
Check for conflicting sources
- Multiple sources with similar content can confuse agent
- Remove duplicates or be explicit which to use
OCR isn't working
OCR isn't working
Problem: Agent can’t read scanned PDFs or imagesSolutions:
- Use higher resolution (300 DPI minimum)
- Improve contrast (black text on white background is best)
- Straighten pages (no skew or rotation)
- Avoid handwriting (OCR works best on printed text)
- Try native PDF (if possible, use text-based PDF instead of scan)
Best Practices
Quality over quantity: 10 highly relevant documents > 100 generic documents
Name descriptively: “2025_Q1_Pricing” > “doc_v3_final.pdf”
Keep sources updated: Set refresh schedules or upload new versions regularly
Reference explicitly: Tell agents which sources to use and when
Verify citations: Check agent’s cited sources in run logs to ensure accuracy
Organize logically: Group related sources, delete outdated ones
Test with edge cases: Upload challenging documents to see how well RAG handles them