Screenshot as an API
References
Screenshots as the Universal API
- With ML advancements, screenshots are now a universal data format.
- (decoder) relatively easy to extract...
- (encoder) diffusion-based models like Stable Diffusion and DALL-E (text-to-image) Prompt Engineering
- What's good:
- Easier to parse than highly complex layout formats
- No need to understand PDF data format
- No need to hydrate webs for web crawlers
- Universally available, easily copyable
- Easier to parse than highly complex layout formats
- Permissionless.
- Many applications won't allow you to export data.
- Screenshots are always available.
- Related to when Naver Vibe attempted to steal other music players' market cap with Screenshot Recognition technology.
- More complex metadata
- I wrote a reply like the following. Letter to Mr. Matt Rickard on 2022-10-03
Rethinking the PDF
- It's founder, John Warnock (co-founder of Adobe), prototyped a compatibility layer where documents would look and, most importantly, print (!) the same regardless of the computer they were viewed on (1993 video). This is the PDF.
- The "killer app" for PDF was tax returns - the IRS adopted PDF in 1996 because of a rumored frustration with the US Postal Service.
- Things that lack: