Volume 13, Issue 11 • December 9, 2025

Token-Level Language Model Baseline for Gregg Shorthand Symbols

Open access • Peer reviewed • CC BY-NC-SA 4.0

Thomas Jackson (Author) ORCID

Machine LearningLanguage ModelingShorthand

Abstract

This work presents a symbol-sequence language model baseline to support shorthand transcription correction. The model reduced character-sequence perplexity and improved post-recognition consistency on held-out notes.

Citation

Thomas Jackson (2025). Token-Level Language Model Baseline for Gregg Shorthand Symbols. Journal of Young Scientists & Engineers, 13(11). DOI pending assignment.

Identifiers

DOI pending assignment Zenodo

Access

This article is available with open access and permanent identifier links for citation and discovery.