the fine-tuning of language in the content section #21

num3num · 2024-07-18T10:52:39Z

Unitable is a powerful recognition tool, but I want to train table content recognition that supports other languages. Have any good suggestions or opinions?

ShengYun-Peng · 2024-07-22T14:20:09Z

I would suggest finetuning the OCR branch with the targeted language and UniTable should work out-of-the-box.

num3num · 2024-07-29T07:43:59Z

In the recognition of the bbox section, there may be a large amount of text or gaps in a single bbox, which can lead to content loss or misalignment. Do you have any good suggestions for this situation? What model or debugging method is called for pre training or fine-tuning of unitable_1arge_bbox.pt?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the fine-tuning of language in the content section #21

the fine-tuning of language in the content section #21

num3num commented Jul 18, 2024

ShengYun-Peng commented Jul 22, 2024

num3num commented Jul 29, 2024 •

edited

Loading

the fine-tuning of language in the content section #21

the fine-tuning of language in the content section #21

Comments

num3num commented Jul 18, 2024

ShengYun-Peng commented Jul 22, 2024

num3num commented Jul 29, 2024 • edited Loading

num3num commented Jul 29, 2024 •

edited

Loading