Data sources crediting

This commit is contained in:
veniasilente 2023-06-30 23:48:04 -04:00
parent 359f5ed9bd
commit 071c928c8a
2 changed files with 28 additions and 2 deletions

View file

@ -21,7 +21,12 @@ Dictionary files follow a naming schema of the form ``{content}.{lang}.dic``, wh
* Dictionaries for Spanish ("es") and Italian ("it") official Pokémon names are **currently pending**, as it was not until G9's release of Paradox forms that the "it" and "es" internationalization teams started diverging from the "en" namelist.
* Files with the prefix ``pkunreleased`` contain names for unreleased entities, such as the leaked "beta Pokémon". These are credited in an associated CSV file where possible.
* Files with the prefix ``pkunreleased`` contain names for unreleased entities, such as the leaked "beta Pokémon".
* Where possible, data sources are credited in a folder-level ``SOURCES.md`` or in a per-source associated CSV file.
## License
These contents are licensed under **Creative Commons** NC-SA-4.0. For more details check ``LICENSE``.

21
original/SOURCES.md Normal file
View file

@ -0,0 +1,21 @@
## Data Sources
For original / canon information, the following sources are used:
* Files ``pokedex_pokemon.${lang}.dic`` are fed from Bulbapedia's page of "List of Pokémon in other languages" and from a veekun schema-compatible database listing.
* Files ``moves.${lang}.dic`` are fed from Bulbapedia's page of "List of moves in other languages" and from a veekun schema-compatible database listing.
* Other files are fed from Bulbapedia's tabular data pages on internationalization, such as ["List of locations in other languages"](https://bulbapedia.bulbagarden.net/wiki/List_of_locations_in_other_languages)
The following SQL query, applied to a Veekun "pokedex" schema, extracts in its second column the names for Pokémon in a given ISO language code ("``{lang}``"):
```
select P.id, PN.name
FROM
pokemon P
left join pokemon_species_names PN
on PN.pokemon_species_id = P.id
join languages L on PN.local_language_id=L.id
and L.identifier={lang}
where P.id is not null
order by P.id, L.id
```