Anna’s Archive
🌐 en - English - English
am - አማርኛ - Amharic
ar - العربية - Arabic
ast - asturianu - Asturian
az - azərbaycan - Azerbaijani
be - беларуская - Belarusian
bg - български - Bulgarian
bn - বাংলা - Bangla
br - Brasil: português - Portuguese (Brazil)
ca - català - Catalan
ckb - کوردیی ناوەندی - Central Kurdish
cs - čeština - Czech
da - dansk - Danish
de - Deutsch - German
el - Ελληνικά - Greek
en - English - English ☑️
eo - Esperanto - Esperanto
es - español - Spanish
et - eesti - Estonian
fa - فارسی - Persian
fi - suomi - Finnish
fil - Filipino - Filipino
fr - français - French
gl - galego - Galician
gu - ગુજરાતી - Gujarati
ha - Hausa - Hausa
he - עברית - Hebrew
hi - हिन्दी - Hindi
hr - hrvatski - Croatian
hu - magyar - Hungarian
hy - հայերեն - Armenian
id - Indonesia - Indonesian
it - italiano - Italian
ja - 日本語 - Japanese
jv - Jawa - Javanese
ka - ქართული - Georgian
ko - 한국어 - Korean
lt - lietuvių - Lithuanian
ml - മലയാളം - Malayalam
mr - मराठी - Marathi
ms - Melayu - Malay
ne - नेपाली - Nepali
nl - Nederlands - Dutch
no - norsk bokmål - Norwegian Bokmål (Norway)
or - ଓଡ଼ିଆ - Odia
pl - polski - Polish
ps - پښتو - Pashto
pt - Portugal: português - Portuguese (Portugal)
ro - română - Romanian
ru - русский - Russian
sk - slovenčina - Slovak
sl - slovenščina - Slovenian
sq - shqip - Albanian
sr - српски - Serbian
sv - svenska - Swedish
ta - தமிழ் - Tamil
te - తెలుగు - Telugu
th - ไทย - Thai
tr - Türkçe - Turkish
tw - 中文 (繁體) - Chinese (Traditional)
uk - українська - Ukrainian
ur - اردو - Urdu
vec - veneto - Venetian
vi - Tiếng Việt - Vietnamese
yue - 粵語 - Cantonese
zh - 中文 - Chinese
Account
If you are interested in mirroring this dataset for
archival or
LLM training purposes, please contact us.
Overview from
datasets page .
Source
Metadata
Last updated
Google Books [gbooks]
❌ Not available directly in bulk, protected against scraping.
❌ Most files are closely guarded. We will award a
$200k bounty if you can get the full collection.
2024-09-20
Volunteer “j” has managed a large scrape of Google Books metadata.
Metadata is good to have, but the real goal is to get their actual scans. In 2019 Google claimed to have scanned 40 million books. Since the AI race heated up in late 2022, it is to be expected that Google has increased their rate of scanning. We will award a $200k bounty if you can get the full collection.
Resources