Jyong 0737e930cb chore: remove Langchain tools import (#3407) 1 year ago
..
blod 0737e930cb chore: remove Langchain tools import (#3407) 1 year ago
entity 5b953c1ef2 Fix some RAG bugs (#2570) 1 year ago
unstructured b00466f025 feat:api Add support for extracting EPUB files in ExtractProcessor (#3254) 1 year ago
csv_extractor.py 6164604462 fix dataset retrival in dataset mode (#3334) 1 year ago
excel_extractor.py ad65c891e7 add xls file suport (#3321) 1 year ago
extract_processor.py ad65c891e7 add xls file suport (#3321) 1 year ago
extractor_base.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 year ago
helpers.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 year ago
html_extractor.py 5b953c1ef2 Fix some RAG bugs (#2570) 1 year ago
markdown_extractor.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 year ago
notion_extractor.py a4d86496e1 fix: notion extractor raise 'NoneType' object has no attribute 'curre… (#2608) 1 year ago
pdf_extractor.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 year ago
text_extractor.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 year ago
word_extractor.py b163545771 Use `python-docx` to extract docx files (#2654) 1 year ago