Projects - Louise Ferreira

🔬 MSc Dissertation Research • University of Essex

AlgoFairness Pornometrics: Fair ML for User-Generated Adult Video Platforms

A complete, reproducible research pipeline for auditing algorithmic bias with intersectional analysis

⚠️ Research Content Note

This research analyzes adult content metadata to study algorithmic bias. No explicit imagery is displayed. All analysis follows strict ethical guidelines and focuses on fairness metrics, not content itself.

Why This Research Matters

Content moderation algorithms systematically discriminate against LGBT creators, sex workers, and BIPOC communities. When your ML model decides what's "appropriate," it's encoding cultural biases at industrial scale — and the people getting hurt are always the same: queer folks, Black and Brown creators, anyone outside the cishet white norm.

This thesis proves this discrimination is measurable, quantifiable, and most importantly — fixable. Through a novel Ensemble Fairness Optimization approach, I successfully reduced documented bias while preserving predictive accuracy.

The Research in Numbers

535,236

Videos Analyzed

0.8%

Black Women Representation

92.6%

Best Model Accuracy (BERT)

-12.6%
Baseline Gap (Black Women)

Key Findings: Bias is Measurable

High overall model accuracy masks significant underlying biases. The RF model's F1-score for Asian Women (0.119) is less than a quarter of the score for White Women (0.583), demonstrating severe performance disparity.

Model	Group	Accuracy	F1-Score
Random Forest	Asian Women	79.5%	0.119
Random Forest	Black Women	83.3%	0.494
Random Forest	White Women	70.7%	0.583
BERT	Black Women	95.6%	0.904

Solutions: Bias Can Be Reduced

I tested three bias mitigation strategies: pre-processing (Reweighing), in-processing (Exponentiated Gradient with Demographic Parity), and post-processing (Calibrated Equalized Odds). The best approach reduced the accuracy gap for Black women from -12.6% to -4.6% — a 64% improvement.

Baseline RF

-12.6% accuracy gap

Reweighed RF (Best Accuracy)

86.2% acc, EOD: 0.053

In-Processing (EG + DP)

-4.6% gap (64% improvement!) ✨

30-Step Reproducible Pipeline

This isn't just a paper — it's a complete, reproducible research framework with 30 automated steps from raw data to final analysis, including interactive dashboards and causal inference.

1-6 Data & Bias Discovery

7-13 Modeling & Fairness

15-23 Advanced Analysis

24-30 Synthesis & Outputs

Tech Stack

🐍 Python 3.12+ 🔬 scikit-learn ⚖️ Fairlearn 🔥 PyTorch 🤖 BERT 🕸️ NetworkX 🐼 Pandas 📊 Statsmodels 📈 Matplotlib/Seaborn

View Full Case Study → GitHub Repository

Accuracy-Fairness Pareto Frontier showing trade-offs between different mitigation strategies

Corpus Overview Dashboard showing dataset composition and statistics

Mitigation Effectiveness comparing bias reduction across strategies

Fairness Curves showing model performance across demographic groups

Accuracy-Fairness Pareto Frontier: Comparing mitigation strategies across the trade-off space

AI Governance • NLP Audit 2025

MIT Policy Hackathon 2025

DocScope Copilot

An automated auditing tool for AI documentation governance and policy analysis

The Crisis: Voluntary AI documentation standards are failing. My analysis of 22 major AI model cards (including GPT-4o) revealed a systematic lack of substantive governance data.

The Solution: DocScope Copilot is an automated auditing tool that processes technical documentation to reveal the gap between "framework recommendations" and "actual industry practice."

Key Audit Metrics

22 Documents Audited

1,303 Text Chunks NLP-Scored

                  81%
                  Disclosure Gap
                

0.537 Avg. Quality Score

Tech Stack

🐍 Python 💬 NLP (Spacy/NLTK) 📋 JSON Schema 📝 Markdown Parsing 🔍 Automated Auditing

View Case Study →

Personal Data • Cinema Analytics 2024–2025

CineScope: Personal Cinema Analytics

Years of curated watchlists turned into a full analytics pipeline on my own film life

CineScope takes my personal film collection — IMDb exports, local databases and hand-curated lists — and treats it like a research dataset. I built a pipeline that enriches every title with external sources, then visualises my patterns: which eras I over-index on, who I really watch, and where the gaps in my collection are.

The Dataset

2,000+ Films Tracked

5+ External Sources

30+ Visual Dashboards

Tech Stack

🐍 Python 🐼 Pandas 📊 Matplotlib 🗄️ SQLite 🔗 APIs (TMDb, OMDb, Wikidata)

View CineScope Case Study →

Machine Learning • Data Mining 2024-2025

Cinema Through Data: Cultural Analysis (1915-1960)

Large-scale cultural data mining meets film history — 140,000+ movies, one massive pipeline

What can 140,000 old movies tell you about culture, representation, and power dynamics? I built a full-stack data product to find out — from scraping IMDb datasets to training ML models that actually understand film.

The Scale

140,000+ Titles Processed

77,000 Graph Nodes

313,000 Graph Edges

                  98.71%
                  Model Accuracy
                

Tech Stack

🐍 Python 🕸️ NetworkX 🚀 LightGBM 📱 Streamlit 🧠 TensorFlow

View Code & Demo →

NLP & Infrastructure Summer 2025

AI Chatbot for Inter-American Development Bank

Enterprise-grade semantic search at scale

Built an AI-powered chatbot for the IDB's Office of the Secretary that doesn't just keyword search — it actually understands what you're asking and finds the right documents.

Tech Stack

🐍 Python 💻 C# ☁️ Azure OpenAI 🔍 FAISS 📄 PyPDF2

Proprietary Project

Information Retrieval 2024

Search Engine Performance Analysis

Making search suck less through systematic evaluation

Built a framework to measure exactly how search engines fail to understand complex queries. Created gold-standard evaluation datasets and Python automation for precision/recall calculation.

Tech Stack

🐍 Python 🔎 Information Retrieval 📏 Evaluation Metrics

More info soon

Want to See More?

I'm constantly working on new projects at the intersection of AI fairness, NLP, and ethical tech. Check out my GitHub for the latest code, or get in touch if you want to collaborate.

Browse GitHub Let's Collaborate

Projects That Matter

AlgoFairness Pornometrics: Fair ML for User-Generated Adult Video Platforms

Why This Research Matters

The Research in Numbers

Key Findings: Bias is Measurable

Solutions: Bias Can Be Reduced

30-Step Reproducible Pipeline

Tech Stack

DocScope Copilot

Key Audit Metrics

Tech Stack

CineScope: Personal Cinema Analytics

The Dataset

Tech Stack

Cinema Through Data: Cultural Analysis (1915-1960)

The Scale

Tech Stack

AI Chatbot for Inter-American Development Bank

Tech Stack

Search Engine Performance Analysis

Tech Stack

Want to See More?