-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathREADME.html
64 lines (53 loc) · 1.86 KB
/
README.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
<!DOCTYPE html>
<html>
<head>
<meta charset="utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>Amharic NLP Tools in Java</title>
<link rel="stylesheet" href="assets/bootstrap/css/bootstrap.min.css">
</head>
<body>
<header>
<nav class="navbar navbar-default">
<div class="container">
<div class="navbar-header">
<center>
<a class="navbar-brand navbar-link" href="#">
<span class="glyphicon glyphicon-text-size"></span>
<span class="text-title">Amharic NLP Tools in Java </span>
<p>
This repository contains implementations of various Natural Language Processing (NLP) tasks and tools specifically for the Amharic language using Java. The goal is to provide a comprehensive set of tools to facilitate NLP research and development for Amharic.
</p>
</a>
</center>
</div>
</div>
</nav>
</header>
<div class="container">
<p>
# Amharic-NLP-Tools-in-JAVA
## Amharic Text Preprocessing:
- Tokenization
- Sentence Segmentation
- Normalization
--Character normalization
--Abbreviation substitution
--Strange char, word, symbole remove
-- Removal of emojis
-- Removal of emoticons
-- Removal of Punctuations
-- Conversion of emoticons to words
-- Conversion of emojis to words
- StopWord Removal
- Lemmatization
- Stemming
- Named Entity Recognition
</p>
</div>
<footer>
<h5>Intel 8086 Tutorial by Bushra KM © 2022; bushra.kmb@gmail.com</h5></footer>
<script src="assets/js/jquery.min.js"></script>
<script src="assets/bootstrap/js/bootstrap.min.js"></script>
</body>
</html>