Main
1 Download a video essay's audio via yt-dlp:
yt-dlp -f m4a <video_url> -o output.m4aor extract via ffmpeg if download isn't necessary:
ffmpeg -i input.mp4 -vn -c:a copy output.m4a2 transcribe the audo file via OpenAI Whisper:
whisper --model tiny.en --device cuda -f txt output.m4aNote that this works only for English videos if you have a cuda-compatible GPU. If you don't have one then:
whisper --model tiny.en -f txt output.m4aAnd if it's a non-English video
whisper --model tiny --device cuda -f txt output.m4a3 Simply give the file to a LLM chatbot like ChatGPT and ask him to format the text. Prompt:
format this diabtre into a well-organised, concise article. Mention any details that could be considered 'absurd'.