Skip to content

Latest commit

 

History

History
24 lines (18 loc) · 1.95 KB

File metadata and controls

24 lines (18 loc) · 1.95 KB

Preprocessing Unstructured Data for LLM Applications

https://learn.deeplearning.ai/courses/preprocessing-unstructured-data-for-llm-applications

Improve your RAG system to retrieve diverse data types

  • Learn to extract and normalize content from a wide variety of document types, such as PDFs, PowerPoints, Word, and HTML files, tables, and images to expand the information accessible to your LLM.

  • Enrich your content with metadata, enhancing retrieval augmented generation (RAG) results and supporting more nuanced search capabilities.

  • Explore document image analysis techniques like layout detection and vision and table transformers, and learn how to apply these methods to preprocess PDFs, images, and tables.

  • Beginner

  • Matt Robinson

  • Prerequisite recommendation: This is a beginner-friendly course.

Videos