Extract outcomes from narrative reports

impact-measurementintermediateemerging

The problem

You've got 50 project reports in Word documents, each describing impact in narrative form ('participants showed increased confidence', 'families accessed better housing'). You need to aggregate this for your annual report or funder, but extracting and counting outcomes manually would take weeks. The data exists but it's trapped in prose.

The solution

Use an LLM to read your narrative reports and extract structured outcome data. Tell it what outcomes you track (wellbeing improved, employment gained, skills learned), and it pulls out: what outcomes were achieved, for how many people, and the evidence cited. What was scattered across 50 documents becomes a dataset you can count and analyze.

What you get

A structured spreadsheet with columns: report name, outcome type, number of beneficiaries, evidence/method, confidence score. You can now answer questions like 'How many people improved wellbeing across all our projects?' or 'Which projects achieved employment outcomes?' The narrative becomes queryable data.

Before you start

Project reports or evaluations in digital format (PDF, Word, or text) - the example script processes .txt files, so convert Word/PDF to plain text first
A defined outcomes framework - what outcomes you're looking for
An OpenAI or Anthropic API key for batch processing
Basic Python skills or willingness to adapt example code
IMPORTANT: Anonymise or de-identify reports before processing - narrative reports often contain beneficiary PII or case studies. Check your data processing agreements with AI providers cover this use case

When to use this

You have many narrative reports and need aggregate outcome data
You're writing annual reports or impact summaries
Funders want outcome numbers but you've only got stories
Manual extraction would take longer than you have

When not to use this

You only have a few reports - quicker to extract manually
Reports don't mention outcomes clearly - AI can't extract what isn't there
You need precise numbers for statutory reporting - validate carefully
Your outcomes framework is unclear or inconsistent across projects

Steps

1
Define your outcomes framework
List the outcomes you track across projects: 'increased wellbeing', 'gained employment', 'improved housing situation', 'reduced isolation', 'developed skills'. Be specific and use consistent language. This is what you'll ask the AI to extract.
2
Test extraction on sample reports
Take 3-5 reports and manually identify what outcomes they mention. Then ask Claude or ChatGPT to extract outcomes from the same reports. Compare: did it find what you found? Did it miss anything? Did it hallucinate outcomes not present?
3
Refine your extraction prompt
Based on the test, improve your prompt. Emphasize: only extract outcomes explicitly stated (don't infer), include the evidence/method if mentioned (survey, interview, observation), note confidence (was it 'all participants' or 'some participants'?), flag vague claims.
4
Convert reports to text
Get your reports into text format the AI can read. Word docs and plain PDFs work fine. Scanned PDFs need OCR first. Keep original formatting reasonable - tables and bullet points help the AI understand structure.
5
Run batch extraction
Use the API and example code to process all reports. The script reads each report, extracts structured outcome data, and builds a CSV. For 50 reports this might take 30-60 minutes. Monitor a few to check quality stays consistent.
6
Validate the results
Spot-check extractions against original reports. Did the AI accurately capture what was claimed? Are numbers correct? Is confidence scoring sensible? Check at least 20% of your reports. Look for patterns in errors - maybe the AI struggles with one outcome type.
7
Clean and aggregate
You'll have some inconsistencies - 'employment' vs 'gained work', 'wellbeing' vs 'mental health'. Standardise these in your spreadsheet. Then you can aggregate: total beneficiaries per outcome type, which projects achieved which outcomes, evidence methods used.
8
Use for impact reporting(optional)
Now you can answer questions like: 'Across all projects, 450 people reported improved wellbeing (measured by survey), 78 gained employment, 120 improved housing situation.' Back this up with the narrative stories from your original reports for compelling impact reporting.

Example code