-
Notifications
You must be signed in to change notification settings - Fork 705
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bulk bring your own data #683
Comments
Hi @vicentegarciadiez, great question. There is a tool for importing outside of the application, but we've recently discovered it doesn't function outside of the dev-environment: https://github.com/microsoft/chat-copilot/tree/main/tools/importdocument The kernel-memory repo has all the machinery in place for you to take matters in your own hands. All you need to be able to do is post documents to the same queue and blob store that your chat-copilot is configured for (See The most concise expression of what this might resemble and be viewed @ https://github.com/microsoft/kernel-memory/blob/main/service/Service/Program.cs. (Although you could run yours as a console application.)
|
Thanks @crickman for your answer! But I've a question, in your example, mydocument.docx will be available to all chats or only to a selected chat? Best regards. |
Right...good point...I've erroneaously ommitted those details. This would be a more complete expression (with some of the values expanded as literals):
The related code in CC is: The code for accessing CosmosDB data is: |
Thanks @crickman and do you know how the images inside a document are indexed? I mean, is the ocr processing those images? Thanks in advance. |
I do not belive images are processed using OCR for docx., pptx, xslx, or pdf.
I have sometimes used extrnal tools to convert documents with complex structure (to text) and then upload the text result. Azure Forms Recognizer has some options for more complex document parsing also. |
Hi team, is there any way of bring my own data to a chat but in a massive way?
I mean, I want to load lots of PDF files to a chat to ask questions about them, but there're lots of limitations like 10 files per time or size limits.
Thanks in advance.
The text was updated successfully, but these errors were encountered: