docs / internet-archive

Internet Archive API

Internet Archive API

base /internet-archive/v16 endpoints
post/internet-archive/v1/item1 credit

Full metadata for one archive.org item by identifier: title, creator, description, date, publisher, language, subjects, collections, license, rating — plus the COMPLETE file list (name, format, size, duration, source) with direct download URLs, and the item's reviews. Set include_files=false / include_reviews=false to slim the payload.

ParameterAllowed / rangeDescription
identifierrequiredThe archive.org item identifier (the slug in archive.org/details/<identifier>). From a search result's 'identifier'.
include_files = trueoptionalInclude the full file list with download URLs (default true).
include_reviews = trueoptionalInclude the item's reviews (default true).
files_limit = 200optional1–2000Cap on files returned (1-2000, default 200); clamped.
Try in playground →
post/internet-archive/v1/collection1 credit

Browse the items inside an archive.org collection by its identifier (e.g. 'nasa', 'librivoxaudio', 'prelinger', 'GratefulDead'). Same rich item rows as search, paginated and sortable; optionally narrow by mediatype within the collection.

ParameterAllowed / rangeDescription
collectionrequiredThe collection identifier to browse (from a search result's 'collections', or a known name).
mediatypeoptionaltexts · audio · movies · software · image · etree · data · web · collection · accountFilter to one archive.org media bucket: texts, audio, movies, software, image, etree (concerts), data, web, collection.
queryoptionalFree-text search across titles, descriptions, creators and subjects. Accepts Lucene syntax (e.g. 'title:moon AND year:[1960 TO 1970]'). Optional if you pass a mediatype/collection/creator filter instead.
sortoptionaldownloads desc · downloads asc · week desc · publicdate desc · publicdate asc · date desc · date asc · avg_rating desc · titleSorter asc · titleSorter descResult ordering. Defaults to most-downloaded.
rows = 25optional1–100Results per page (1-100, default 25); clamped.
page = 1optional1–1000Page number (1-based). Page until meta.has_more is false.
Try in playground →
post/internet-archive/v1/reviews1 credit

All public reviews and star ratings for one archive.org item: reviewer handle, star rating (0-5), review title, body and date. Reviews are embedded in the item record, so this returns every review in one call (no pagination needed).

ParameterAllowed / rangeDescription
identifierrequiredThe archive.org item identifier (the slug in archive.org/details/<identifier>). From a search result's 'identifier'.
Try in playground →
post/internet-archive/v1/files1 credit

Just the downloadable files for one archive.org item (no heavy metadata/reviews): name, format, size, duration, source and a direct download URL for each. Useful for fetching the actual book PDF / MP3 / video / ZIP.

ParameterAllowed / rangeDescription
identifierrequiredThe archive.org item identifier (the slug in archive.org/details/<identifier>). From a search result's 'identifier'.
formatoptionalOptional case-insensitive substring filter on the file format (e.g. 'PDF', 'MP3', 'MPEG4', 'EPUB').
files_limit = 500optional1–5000Cap on files returned (1-5000, default 500); clamped.
Try in playground →
post/internet-archive/v1/search_inside1 credit

Full-text 'search inside' one digitized book on archive.org: find a phrase within the book's OCR text and get the matching snippets with page numbers. The item must be a texts/book item with OCR (most scanned books are).

ParameterAllowed / rangeDescription
identifierrequiredThe archive.org item identifier (the slug in archive.org/details/<identifier>). From a search result's 'identifier'.
queryrequiredThe phrase to find inside the book's text.
Try in playground →