Earn 13,500 ($135.00)
Custom Document Management System (DMS) with ChatGPT, Llama Index and ChromaDB Integration
Bounty Description
Problem Description
We are seeking an experienced developer to build a custom Document Management System (DMS) that integrates ChatGPT, LlamaIndex, LangChain and ChromaDB. The DMS will serve as a personal information retrieval system that accepts, processes, and queries a variety of documents including contracts, government letters, and other official documents.
Acceptance Criteria
Uploading: User can upload scanned PDFs without error.
OCR Functionality: Uploaded PDFs are accurately OCR'd by unstructured.io
Storage: OCR'd texts are embedded into ChromaDB.
Querying: User can retrieve information via a Chainlit chat interface.
Citation: Queried data contains citation and link to the original document in ChromaDB.
Auto-fill: The AI should suggest what to fill in a PDF document and provide citation to the source in the ChromaDB
Technical Details
Languages: Python, SQL, possibly others
Libraries: OCR libraries (such as Unstructured.io), Chainlit, LlamaIndex, Langchain
Databases: ChromaDB for vector storage