Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Table cell content recognition module generates garbage result on unseen data #28

Open
therajmaurya opened this issue Aug 31, 2024 · 0 comments

Comments

@therajmaurya
Copy link

On unseen table images, Table structure extraction and Table cell bbox detection work very well but the Table cell content recognition generates garbage. Replacing it with any open-source OCR model for extracting cell content works better than the given model. There seems to be an issue with the Table cell content recognition model that does not generalize well.

P.S. When I run the Table cell content recognition on the dataset on which it was trained/evaluated, it works extremely well, but fails to generalise on new unseen table images.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant