Objectives

  1. Create a Spanish Grammatical Error Correction system that takes in text, and outputs grammatically corrected text

    Yo no persona famosoYo no soy una persona famosa

  2. [Stretch] Classify grammatical errors as output spans and tags

    (2,2) Missing:SUBJ; (3,4) Replace:GENDER

Datasets

COWS-L2H

Unlabelled Datasets

Evaluation Metrics

ERRANT

(Paper; GitHub)

  1. Performs alignment using a raw sentence and a corrected sentence (either reference or hypothesis)

    Screen Shot 2022-04-08 at 11.47.18 AM.png

  2. By computing alignment between the raw sentence and both the expert-corrected and the model-corrected sentences, we can compute an $F_{0.5}$

    $$ F_\beta = (1 + \beta^2)\frac {P*R} {(\beta^2 P) + R} $$

    $$ P = \frac {TP} {TP + FP}; R = \frac {TP} {TP + FN} $$