DocTr++论文复现 #11475

chenjjcccc · 2024-01-08T09:20:14Z

DocTr++论文复现

paddle-bot · 2024-01-08T09:20:19Z

Thanks for your contribution!

CLAassistant · 2024-01-08T10:07:18Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
0 out of 2 committers have signed the CLA.

❌ chenjiajun05
❌ paddle-models

chenjiajun05 seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

tink2123

非常感谢您的贡献~ 请提交训练参数和inference模型方便我们进行验证。另外辛苦将代码融合进PaddleOCR框架，可以参考:
https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.7/applications/%E9%AB%98%E7%B2%BE%E5%BA%A6%E4%B8%AD%E6%96%87%E8%AF%86%E5%88%AB%E6%A8%A1%E5%9E%8B.md

tink2123 · 2024-01-12T07:38:03Z

applications/Doctr++文档矫正/doc3d_dataset.py

+        image_path = os.path.join(self.root, "img", image_name + ".png")
+        image = cv2.imread(image_path)
+
+        # if image is None:


删除多余代码

tink2123 · 2024-01-12T07:39:21Z

applications/Doctr++文档矫正/load_dataset.sh

+    echo " # done"
+}
+
+doc3d_download "http://vision.cs.stonybrook.edu/~sagnik/doc3d/img_1.zip" "$outputPath/" "img_1.zip"


建议写一个循环，把文件名放在一个list中

tink2123 · 2024-01-12T07:40:34Z

applications/Doctr++文档矫正/train.sh

+export FLAGS_logtostderr=0
+export CUDA_VISIBLE_DEVICES=4
+
+python train.py --data-root /ssd1/chenjiajun05/chenjiajun05/doc3d \


不要包含个人目录，建议删除sh脚本，将启动训练说明写在文档中

tink2123 · 2024-01-12T07:43:13Z

applications/Doctr++文档矫正/Doctr++_plus.md

+./train.sh
+```
+每个époch后，会进行验证集评估，保存best_model以及last_model。
+## 7. 模型评估


请给出评估精度指标

tink2123 · 2024-01-12T07:43:17Z

applications/Doctr++文档矫正/Doctr++_plus.md

+chmod +x train.sh
+./train.sh
+```
+每个époch后，会进行验证集评估，保存best_model以及last_model。


tink2123 · 2024-01-12T07:44:33Z

applications/Doctr++文档矫正/Doctr++_plus.md

+准备工作完成后，即可开始进行模型训练。本项目提供训练脚本train.sh。其中需要自行修改数据集路径--data-root
+```python
+# 模型训练
+chmod +x train.sh


将脚本替换为完整启动命令，参考：

python3 tools/train.py -c configs/rec/PP-OCRv3/en_PP-OCRv3_rec.yml -o Global.checkpoints=./your/trained/model

tink2123 · 2024-01-12T08:00:00Z

applications/Doctr++文档矫正/Doctr++_plus.md

+python eval_DocUNet.py --i ./crop/ --m pretrained_model  --o output_val --g ./scan/
+```
+效果如下：
+


请增加推理部署相关说明：包括单图推理脚本，inference模型导出命令，以及基于inference模型预测脚本。参考：
https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.7/doc/doc_ch/algorithm_rec_nrtr.md#4-%E6%8E%A8%E7%90%86%E9%83%A8%E7%BD%B2

nissansz · 2024-03-27T10:00:17Z

这个有训练好的模型吗？怎么使用？

chenjjcccc changed the title ~~My app doc~~ 提pr弄错了 Jan 8, 2024

chenjjcccc changed the title ~~提pr弄错了~~ 提pr弄错了，请删除 Jan 8, 2024

the first commit

1de41f6

chenjjcccc changed the base branch from release/2.7 to dygraph January 8, 2024 10:08

chenjjcccc changed the title ~~提pr弄错了，请删除~~ DocTr++论文复现 Jan 8, 2024

tink2123 reviewed Jan 12, 2024

View reviewed changes

paddle-models added 2 commits January 26, 2024 17:11

update predict

ad85717

update

9470433

GreatV mentioned this pull request Jan 30, 2024

版面矫正网络DocTr++论文复现 #10379

Closed

Sunting78 mentioned this pull request Mar 27, 2024

paddle怎么训练docunet? #11817

Closed

jzhang533 deleted the branch PaddlePaddle:dygraph April 22, 2024 03:26

jzhang533 closed this Apr 22, 2024

github-actions bot locked as resolved and limited conversation to collaborators Nov 11, 2024

paddle-bot bot added the contributor label Nov 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DocTr++论文复现 #11475

DocTr++论文复现 #11475

chenjjcccc commented Jan 8, 2024 •

edited

Loading

paddle-bot bot commented Jan 8, 2024

CLAassistant commented Jan 8, 2024 •

edited

Loading

tink2123 left a comment

tink2123 Jan 12, 2024

tink2123 Jan 12, 2024

tink2123 Jan 12, 2024

tink2123 Jan 12, 2024

tink2123 Jan 12, 2024

tink2123 Jan 12, 2024

tink2123 Jan 12, 2024

nissansz commented Mar 27, 2024

DocTr++论文复现 #11475

DocTr++论文复现 #11475

Conversation

chenjjcccc commented Jan 8, 2024 • edited Loading

paddle-bot bot commented Jan 8, 2024

CLAassistant commented Jan 8, 2024 • edited Loading

tink2123 left a comment

Choose a reason for hiding this comment

tink2123 Jan 12, 2024

Choose a reason for hiding this comment

tink2123 Jan 12, 2024

Choose a reason for hiding this comment

tink2123 Jan 12, 2024

Choose a reason for hiding this comment

tink2123 Jan 12, 2024

Choose a reason for hiding this comment

tink2123 Jan 12, 2024

Choose a reason for hiding this comment

tink2123 Jan 12, 2024

Choose a reason for hiding this comment

tink2123 Jan 12, 2024

Choose a reason for hiding this comment

nissansz commented Mar 27, 2024

chenjjcccc commented Jan 8, 2024 •

edited

Loading

CLAassistant commented Jan 8, 2024 •

edited

Loading