Bowen Liang ccb6ddd840 chore: bump Ruff to 0.5.7 (#7174) 9 月之前
..
blod f976740b57 improve: mordernizing validation by migrating pydantic from 1.x to 2.x (#4592) 11 月之前
entity 12c815c597 fix: ExtractSetting optional value missing None as default val (#5238) 11 月之前
firecrawl a9ee52f2d7 Fix/firecrawl parameters issue (#6213) 10 月之前
unstructured ccb6ddd840 chore: bump Ruff to 0.5.7 (#7174) 9 月之前
csv_extractor.py 58db719a2c dep: bump pandas from 1.x to 2.x (#4820) 11 月之前
excel_extractor.py cf258b7a67 add xlsx support hyperlink extract (#6722) 10 月之前
extract_processor.py 79cb23e8ac security/SSRF vulns (#6682) 10 月之前
extractor_base.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 年之前
helpers.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 年之前
html_extractor.py 5b953c1ef2 Fix some RAG bugs (#2570) 1 年之前
markdown_extractor.py 5e4ac11df3 fix: code block segmentation problem of markdown document (#6465) 10 月之前
notion_extractor.py c8f5dfcf17 refactor(rag): switch to dify_config. (#6410) 10 月之前
pdf_extractor.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 年之前
text_extractor.py 6c4e6bf1d6 Feat/dify rag (#2528) 1 年之前
word_extractor.py 12095f8cd6 extract docx filter comment element (#7092) 9 月之前