Input Variables
Console Log
You can copy and paste
You can also provide any
If the
I am using
YouTube
video URL
,
it will extract video_id
and use
YouTube Transcript API
to
get the video transcript in English. Support for other languages can be added, even on-the-fly
translation is possible. If the transcript is not available, it will show an error message. You can also provide any
Web URL
, it will extract the text
and summarize it using AsyncHtmlLoader
and Html2TextTransformer
PDF
URLs are now supported! It uses PyPDFLoader
and
some custom logic to reduce documents to fit in the LLM context window.
If the
text
to be summarized fits in the LLM context window
it is directly
summarized. If it is too long, it will be split into chunks, each chuck is summarized and then combined.
This approach is called Map Reduce
.I am using
🦜️🔗 LangChain
to
steer the process,
SocketIO
for streaming logs and LLM output
in realtime.