STEM document accessibility, EquitableDocs

01 Why STEM content needs a dedicated pipeline

Standard auto-tagging tools fail on STEM content.

Equations become unreadable strings. Complex tables lose their header associations. Multi-column layouts break reading order. Students using screen readers hear noise instead of mathematics.

i

Equations

A screen reader cannot interpret an image of an equation. Every equation needs three things: plain-language alt text, MathML markup for braille displays, and the original LaTeX source for re-use.

ii

Complex tables

STEM tables often have multi-level headers, merged cells, and spanning rows. Without correct header-cell associations, a screen reader cannot tell a student which column or row a value belongs to.

iii

Multi-column layouts

Research papers, lab manuals, and many textbooks use two-column layouts. Auto-taggers frequently read across columns instead of down them, producing garbled output for screen reader users.

02 The eight-stage pipeline

Eight stages from intake to delivery.

Every STEM document passes through all eight stages. Automation handles bulk processing. Trained STEM specialists handle what automation cannot.

Status: The STEM pipeline is in active development. Early university partner enquiries are open now; full pipeline availability is planned for later in 2026. Get in touch to discuss your institution's needs.

01
Intake and classification

The document enters the queue. The system detects STEM content via keyword and structure analysis and routes it to the STEM pipeline.

AI classifierFastAPI
02
OCR pre-processing

Scanned documents are processed with OCR. Complex layouts use a fallback engine for better accuracy on multi-column and table-heavy pages.

OCRmyPDFTesseract 5PaddleOCR
03
Equation extraction

All equations are identified and extracted. Each equation is converted to LaTeX and MathML. Display and inline equations are flagged separately.

Mathpix OCRLaTeXMathML
04
Auto-tagging

The document is auto-tagged with structural markup: headings, paragraphs, lists, tables, and figures. STEM-specific rules handle equation placement and reading order.

Auto-tag engineAI alt-text
05
Specialist task assignment

The system generates a work order and assigns tasks to trained STEM specialists: equation alt text, table markup, complex figure descriptions, and reading-order review.

Task queueSpecialist routing
06
Equation alt-text writing

A trained STEM specialist writes a plain-language description of every equation. This is what a screen reader will announce. MathML is attached for braille display rendering.

Human-writtenMathML
07
Validation

Full PDF/UA compliance check. Screen-reader spot-test by an experienced specialist. Cross-validation for university deliverables.

veraPDFNVDAPAC
08
Delivery

The remediated document is delivered via the portal, email, or WhatsApp. A compliance certificate is attached for university deliverables. EPUB and tactile diagram formats are available on request.

DownloadEmailWhatsAppEPUBTactile

03 Every equation gets three forms of accessibility

Plain language, MathML, LaTeX.

Mathematical meaning is too easy to misrepresent with automation alone. The plain-language description is always written by a trained STEM specialist.

i

Plain-language alt text

Written by a trained STEM specialist. Describes what the equation says in natural language. This is what screen readers announce.

Example: "The quadratic formula: x equals negative b plus or minus the square root of b squared minus 4ac, all divided by 2a."

ii

MathML markup

Generated from the LaTeX output and embedded in the PDF. Allows advanced screen readers and braille displays to render the equation natively in mathematical notation.

iii

LaTeX source

Stored in document metadata and in the EquitableDocs database. Available for institutions that want to re-use equations in other formats, LMS platforms, or accessible EPUB exports.

04 STEM tables need precise markup

Three rules for three table types.

A screen-reader user navigates tables cell by cell. Without correct header associations, the data is meaningless. Different rules apply to different table types.

i

Value tables

Trigonometric values, physical constants, periodic table sections. Row 1 headers marked with column scope. Column 1 headers marked with row scope. All blank cells tagged as empty.

Rule: TH Scope=Col and TH Scope=Row.

ii

Multi-level header tables

Common in research papers and lab reports. Each header span gets correct ColSpan or RowSpan attributes. Complex associations are documented in the work order for the reviewing specialist.

Rule: ColSpan and RowSpan attributes.

iii

Layout tables

Answer grids, exercise boxes, and visual arrangements that look like tables but contain no data relationships. These are marked as artifacts or restructured as lists. Never tagged as data tables.

Rule: Artifact or restructured as list.

05 For universities with STEM content

The pipeline is available to university partners.

What we handle: Mathematics, physics, chemistry, engineering, economics, and biology textbooks. Lab manuals. Research papers. Exam papers (UPSC, state boards, JEE, NEET). Any document with equations, complex tables, or multi-column layouts.

What universities get: PDF/UA compliant output with MathML equations. EPUB export and tactile diagram formats for braille-ready delivery. Dedicated specialist assignment with STEM expertise. MathML delivery option for LMS integration (Moodle, Canvas, Blackboard). Annual STEM accessibility training included for partner institutions.

06 Discuss your institution's STEM needs

Get in touch.

We are working with early university partners to shape the STEM pipeline. Tell us about your content and we will tell you exactly how we can help.

Start a conversation Email us directly

Equations, tables, and complex layouts, made accessible.

Standard auto-tagging tools fail on STEM content.

Equations

Complex tables

Multi-column layouts

Eight stages from intake to delivery.

Intake and classification

OCR pre-processing

Equation extraction

Auto-tagging

Specialist task assignment

Equation alt-text writing

Validation

Delivery