UKTA Web

Unified Korean Text Analyzer

ACM/SIGAPP SAC 2025 AIED accepted paper (Oral) Paper Arxiv

Morpheme Analysis

Objective

Accurate segmentation of Korean morphemes
Challenging due to agglutinative nature (frequent morphological changes)
Errors propagate and negatively affect higher-level analyses

Approach

Utilize a state-of-the-art Korean morpheme analyzer
Minimize errors in morpheme analysis
Morpheme analyzer: Bareun
Morpheme analyzer used for vocabulary grading: UTagger

Mid-Level Analysis

Objective

Extract diverse linguistic features from morpheme level to sentence, paragraph level features

Approach

Over 294 numerical features, categorized as
Basic features: morpheme counts, density, lengths
Lexical diversity:
- Type-Token Ratios (TTR, RTTR, CTTR)
- MSTTR, MTLD, HD-D, VocD
Cohesion features: semantic similarity, topic consistency, etc.

Writing Evaluation

Objective

Produce explainable, rubric-based writing scores

Approach

Predict 10 rubric scores per essay using attention-based deep learning model

N	Type	Rubric
1	표현 (Expression)	문법 (Grammar)
2		어휘 (Vocabulary)
3		문장 표현 (Sentence Expression)
4	구조 (Organization)	문단 내 구조 (In-paragraph Structure)
5		문단 간 구조 (Inter-paragraph Structure)
6		구조적 일관성 (Structural Consistency)
7		길이 (Length)
8	내용 (Content)	주제 명확성 (Topic Clarity)
9		독창성 (Originality)
10		서사 (Narrative)

Combines
- Sentence-level representations (contextual meaning via pre-trained LM + BiGRU)
- Essay-level features (lexical and cohesion metrics)
Explainability through attention
Identifies which essay-level features most influence final scores
Provides transparency and reliability to users

Name		Name	Last commit message	Last commit date
Latest commit History 291 Commits
backend		backend
bareun		bareun
db		db
frontend		frontend
.gitignore		.gitignore
LICENSE		LICENSE
LICENSE_Bareun_BSD		LICENSE_Bareun_BSD
NOTICE		NOTICE
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

UKTA Web

Unified Korean Text Analyzer

Morpheme Analysis

Objective

Approach

Mid-Level Analysis

Objective

Approach

Writing Evaluation

Objective

Approach

About

Licenses found

Contributors 2

Languages

License

Licenses found

ttytu/UKTA-web

Folders and files

Latest commit

History

Repository files navigation

UKTA Web

Unified Korean Text Analyzer

Morpheme Analysis

Objective

Approach

Mid-Level Analysis

Objective

Approach

Writing Evaluation

Objective

Approach

About

Topics

Resources

License

Licenses found

Stars

Watchers

Forks

Contributors 2

Languages