Skip to content

Latest commit

 

History

History

PreprocessingUnstructuredDataforLLMApplications

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Preprocessing Unstructured Data for LLM Applications

https://learn.deeplearning.ai/courses/preprocessing-unstructured-data-for-llm-applications

Improve your RAG system to retrieve diverse data types

  • Learn to extract and normalize content from a wide variety of document types, such as PDFs, PowerPoints, Word, and HTML files, tables, and images to expand the information accessible to your LLM.

  • Enrich your content with metadata, enhancing retrieval augmented generation (RAG) results and supporting more nuanced search capabilities.

  • Explore document image analysis techniques like layout detection and vision and table transformers, and learn how to apply these methods to preprocess PDFs, images, and tables.

  • Beginner

  • Matt Robinson

  • Prerequisite recommendation: This is a beginner-friendly course.

Videos