New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

add PP-ChatOCRv4 and support PDF #2734

Closed

dyning wants to merge 0 commits into PaddlePaddle:develop from dyning:develop

+0 −0

Collaborator

dyning commented Dec 26, 2024

No description provided.

paddle-bot bot commented Dec 26, 2024

Thanks for your contribution!

TingquanGao reviewed

View reviewed changes

api_examples/pipelines/test_pp_chatocrv4.py Outdated

+                  use_table_recognition=True,
+              )
+              # ####[TODO] 增加类别信息

Collaborator

TingquanGao Dec 27, 2024

这里是后续再改么？

Collaborator Author

dyning Dec 27, 2024

应该不会改了

paddlex/inference/common/result/base_cv_result.py Outdated

Collaborator

TingquanGao Dec 27, 2024

TODO: 其他CV模块需要同步修改。

paddlex/inference/pipelines_new/layout_parsing/pipeline.py Outdated

+                      Returns:
+                          OCRResult: The predicted OCR result with updated dt_boxes.
+                      """
+                      overall_ocr_res = next(self.general_ocr_pipeline(image_array))

Collaborator

TingquanGao Dec 27, 2024

这里及之前用next的原因是因为batch_size为1么？

paddlex/inference/pipelines_new/layout_parsing/result.py Outdated


		class TableRecognitionResult(CVResult, HtmlMixin, XlsxMixin):
		class TableRecognitionResult(BaseCVResult, HtmlMixin, XlsxMixin):

Collaborator

TingquanGao Dec 27, 2024

是否应该将HtmlMixin和XlsxMixin合并为TableMixin，TableMixin提供html、xlsx相关方法。

paddlex/inference/pipelines_new/pp_chatocr/pipeline.py Outdated

+              class PP_ChatOCR_Pipeline(BasePipeline):
+                  """PP-ChatOCR Pipeline"""
+                  entities = ["PP-ChatOCRv3-doc", "PP-ChatOCRv4-doc"]

Collaborator

TingquanGao Dec 27, 2024

后续ChatOCR迭代的话，仍然会沿用这一逻辑吗，如果后续版本有大的变更，那是否应该分开写？

Collaborator Author

dyning Dec 27, 2024

目前看大的逻辑不会变了

paddlex/inference/pipelines_new/pp_chatocr/pipeline.py Outdated

-                          layout_parsing_config = config["SubPipelines"]["LayoutParser"]
-                          self.layout_parsing_pipeline = self.create_pipeline(layout_parsing_config)
+                      layout_parsing_config = config["SubPipelines"]["LayoutParser"]
+                      self.layout_parsing_pipeline = self.create_pipeline(layout_parsing_config)
                       from .. import create_chat_bot

Collaborator

TingquanGao Dec 27, 2024

这里是否要移到上面？

Collaborator Author

dyning Dec 27, 2024

移到上面会循环应用

paddlex/inference/pipelines_new/pp_chatocr/pipeline.py Outdated

+                          all_table_text_list,
+                          all_table_html_list,
+                          all_table_nei_text_list,
+                      ) = all_visual_info
                       final_results = {}
                       failed_results = ["大模型调用失败", "未知", "未找到关键信息", "None", ""]

Collaborator

TingquanGao Dec 27, 2024

看上面的代码，failed_results应该是dict？

Collaborator Author

dyning Dec 27, 2024

修改了

paddlex/inference/pipelines_new/pp_chatocr/pipeline.py Outdated

Collaborator

TingquanGao Dec 27, 2024

def generate_and_merge_chat_results(self, prompt: str, key_list: list, final_results: dict, failed_results: dict)中，failed_results好像是list

Collaborator Author

dyning Dec 27, 2024

修改了

paddlex/inference/pipelines_new/pp_chatocr/pipeline.py Outdated

Collaborator

TingquanGao Dec 27, 2024

# print(prompt, llm_result)这里可以删掉
https://github.com/PaddlePaddle/PaddleX/pull/2734/files#diff-6a1b32b7567e8a063375343e919c35d78242c535f36a551f01c4e9f67df77f1aR462

TingquanGao reviewed

View reviewed changes

paddlex/inference/common/result/base_cv_result.py Outdated

-                      assert (
-                          BaseCVResult.INPUT_IMG_KEY in data
-                      ), f"`{BaseCVResult.INPUT_IMG_KEY}` is needed, but not found in `{list(data.keys())}`!"
-                      self._input_img = data.pop("input_img", None)

Collaborator

TingquanGao Dec 27, 2024 •

edited

Loading

这里建议先不删除吧，我后续提PR删除，因为要配合CV其他模块一起改，其他模块都是依赖self._input_img的，会导致CI挂。

dyning closed this

dyning force-pushed the develop branch from 44182fc to ec942b9 Compare

December 31, 2024 13:54

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet