Overview
Structify provides five powerful AI functions that automatically enhance your data by finding and organizing information from the web. Think of these as smart assistants that can scrape, structure, and find data for you.The Five Functions
structure_pdfs - Turn PDFs into Data Tables
What it does: Converts PDF documents into organized data tables
Best for: Processing reports, invoices, contracts, or any PDF with structured information
Example: “Turn these financial reports into a spreadsheet”
enhance_columns - Smart Data Enhancement ⭐
Most Popular Choice - No need to specify websites, automatically finds the best sources
enhance_relationships - Find Connected People/Entities
What it does: Discovers people or entities related to your data (like founders, employees, investors)
Best for: Building networks, finding key personnel, or mapping relationships
Example: “For each company, find their founders and key executives”
scrape_columns - Extract Single Values from Web Pages
What it does: Pulls specific information from web pages you already know about
Best for: Getting one piece of data per webpage (like a company description from their homepage)
Example: “Get the main product description from each company’s website”
scrape_relationships - Extract Lists from Web Pages
What it does: Finds and organizes lists of items from specific web pages
Best for: Getting multiple related items from known pages (like all job openings, all products, etc.)
Example: “From each company’s careers page, get all open positions”
Choosing the Right Approach
Start with enhance_columns (Recommended)
- When: You want to add information but don’t know exactly where to find it
- Why: It’s the smartest - automatically finds and uses the best sources
- Perfect for: General research and data enrichment
- **Tip: **When using this, give specific prompting for what meets criteria for your columns, and what the columns should look like
Combine for Best Results
The Power Combo: 
enhance_columns → scrape_relationships- First, use enhance_columnsto find relevant web pages
- Then, use scrape_relationshipsto extract detailed lists from those pages
- Start: List of companies
- Step 1: Use enhance_columnsto find their ‘Teams’ page URLs
- Step 2: Use scrape_relationshipsto get all employees listed on their website
When to Use Others
- enhance_relationships:When you need to find one-to-many connections- You have a list of companies, and you want to find their investors and board members
 
- scrape_columns:When you have specific URLs and need information for one row from each- Example: you’re building a people dataset, and have found a list of bio URLs for the people you’re interested
 
- scrape_relationships:When you have specific URLs with lists you want to extract- You want to grab information about the menus of all the restaurant URLs you have
 
Quick Decision Guide
Ask yourself:- “Do I already know which websites have the data?”
- No → Use enhance_columns
- Yes → Use scrape_columnsorscrape_relationships
- “Do I need one value or multiple values per row?”
- One value → Use enhance_columnsorscrape_columns
- Multiple values/lists → Use enhance_relationshipsorscrape_relationships
- “Am I looking for related people/entities?”
- Yes → Use enhance_relationships
- No → Use enhance_columns
Pro Tips
Start Smart
It’s helpful if you understand and describe how a human might solve the problem. For instance, “Navigate to this url, to find these urls, then use scrape_relationships to get all of those URLs, then use enhance_columns / enhance relationships to fill out the schema from those webpages’
Combine Functions
Use one function to find URLs, another to extract data for complex tasks. You can be fully creative in piecing together calls, for results tailored to your use case
Be Specific
The AI works better when you clearly describe what you want
Try First
Most tasks can be solved with just 
enhance_columns - try it first!
