Skip to content
This repository has been archived by the owner on Sep 12, 2024. It is now read-only.

minor update on web_page_reader #130

Merged
merged 2 commits into from
Nov 6, 2023
Merged
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion autollm/utils/web_page_reader.py
Original file line number Diff line number Diff line change
Expand Up @@ -68,7 +68,7 @@ def load_data(self, url: str) -> List[Document]:
tag.decompose()

content = " ".join(soup.stripped_strings)
document = Document(text=content, metadata={"url": url})
document = Document(doc_id=url, text=content, metadata={"url": url})
SeeknnDestroy marked this conversation as resolved.
Show resolved Hide resolved
return [document]


Expand Down