Wikidata:WikiProject Names
Purpose[edit]
This WikiProject aims to improve the structure of name related data in Wikidata. Priority is given to first names.
Contents
Participants[edit]
The participants listed below can be notified using the following template in discussions:
{{Ping project|Names}}
Help[edit]
See http://ashtree.eu/wordpress/prenom-wikidata/ (in French)
- male given name: Sylvain (Q16281827)
- family name: Sylvain (Q18001608)
- name: Sylvain (Q18086660)
- Wikimedia disambiguation page: Sylvain (Q15849776)
- variants of male given name: Silvano (Q677210), Silvan (Q17565057), Silvanus (Q17565188), Szilvánusz (Q17565212), Sylwan (Q17565214)
- variants of surname: Silvanus (Q16281517)
- female version of male given name: Sylvaine (Q17617112)
How to create a new item for a given name[edit]
- start with Special:NewItem
- sample (your language): Special:NewItem?description=male given name
- sample (en): Special:NewItem?description=male given name
- sample (de): Special:NewItem?description=männlicher Vorname
- sample (fr): Special:NewItem?description=prénom masculin
- add instance of (P31) with the appropriate type
- use nameGuzzler to set the label for all languages with Roman script
- use autoEdit to add a description ("male given name"). Resolve conflicts with existing combinations label/description that may arise
- if possible, add said to be the same as (P460) to link to at least one other given name
How to clean up a given name item[edit]
items with P31 = Wikimedia disambiguation page (Q4167410)[edit]
Sometimes given name items include information about a similar family name or links to disambiguation pages. It needs to be decided what to do with the item depending on its content, linked items and already other available items. Options are:
- keep it as a disambiguation
keep it as a given name (first name) itemkeep it as a name itemkeep it as a family name (surname) item
Items should not be re-purposed: the item should be kept as a disambiguation and new items created for given names and family names.
- Uses of the item should be moved to (new) given name/family name items.
- If the disambiguation empty remains without any links, eventually it will be deleted.
Given name items[edit]
For items that are for given names (first names), the following clean-up steps can be needed:
- clean up instance of (P31)
- add native label (P1705)
- clean up labels (for items with native label (P1705) in Roman script, all labels in Roman script should be identical)
- clean up descriptions (remove descriptions related to family names or disambiguation)
- clean up aliases (languages in Roman script generally would not have any aliases for spelling variations).
- clean up interwiki links:
- if an article listed in the interwikis spells the name differently than the label of the item: create a new item for this variant of the name and move the interwiki there
- if interwikis link to disambiguation pages: these should be moved to another item
- clean up uses of the item:
- if the name of the person is spelled differently in the item of the person using it, replace it in given name (P735) with the appropriate item
- if the item is used with family name (P734), replace it with the appropriate item for the family name
How to clean up given name items (top-down approach)[edit]
Sample approach (by Jura):
- Check for items with clean instance of (P31) classification:
- list
- Other items may include "family name" or "disambiguation" in instance of (P31). These are generally not used, but might need cleanup
- Check labels:
- labels in Roman script:
- labels are ideally identical for one item (alternate list, items needing cleanup)
- and unique (few exceptions: Jean/Jean)
- usage: frequently used: >100, rarely used: <10
- labels missing one language (here language=nb)
- labels in Roman script:
- Check descriptions
- Missing given names
- Names
- Selects items that are likely about persons, but lack P735
- Pulls out the first part of the label (likely a given name)
- Filters out (some) Asian names or titles such as "Princess"
- Sitelinks differing from labels: these can be spun off to new items. (Sample: it:"Roberto" if linked on item with label "Robert")
- Wikipedia lists (click "show")
- Names
Main elements[edit]
Main items[edit]
- personal name (Q1071027)
- given name (Q202444)
- family name (Q101352)
- after-name (Q4116295)
- legal name (Q666791)
- nickname (Q49614)
- religious name (Q1417657)
Properties[edit]
| Title | ID | Data type | Description | Examples | Inverse |
|---|---|---|---|---|---|
| given name | P735 | Item | given name: first name or another given name of this person; values used with the property shouldn't link disambiguations nor family names | George Washington <given name> George | - |
| family name | P734 | Item | family name: surname or last name of a person | George Washington <family name> Washington | - |
| birth name | P1477 | Monolingual text | name at birth: full name of a person at birth, if different from their current, generally used name (samples: John Peter Doe for Joe Doe, Ann Smith for Ann Miller) | Mark Twain <birth name> Samuel Langhorne Clemens (language: en) | - |
| noble family | P53 | Item | noble family: include dynasty and nobility houses | Genghis Khan <noble family> Borjigin | - |
| pseudonym | P742 | String | pseudonym: alias used by someone or by which this person is universally known | Mark Twain <pseudonym> Mark Twain | - |
| noble title | P97 | Item | royal or noble rank: titles held by the person | William Mansfield, 1st Baron Sandhurst <noble title> Baron Sandhurst | - |
| honorific prefix | P511 | Item | honorific and title of honor: word or expression used before a name, in addressing or referring to a person | Douglas Haig, 1st Earl Haig <honorific prefix> The Right Honourable | - |
| family name identical to this given name | P1533 | Item | last name that is the same as a given first name. Use on items for given names | Sylvain <family name identical to this given name> Sylvain | - |
| name in native language | P1559 | Monolingual text | name: name of a person in their native language | Barack Obama <name in native language> Barack Hussein Obama II (language: en) | - |
| given name version for other gender | P1560 | Item | equivalent name (with respect to the meaning of the name) in the same language: female version of a male first name, male version of a female first name. Add primarily the closest matching one | Riccardo <given name version for other gender> Riccarda | - |
| name day | P1750 | Item | name day: day of the year associated with a first/given name. A qualifier should be used to identify the calendar that is being used. Distinguish from "feast day" (P:P841) | Lucy <name day> December 13 | - |
| language of work or name | P407 | Item | for works (for original language use P364 and for persons P103 and P1412) | Charles <language of work or name> English | - |
| writing system | P282 | Item | writing system: alphabet, character set or other system of writing used by subject language | William <writing system> Latin script | - |
| second surname in Spanish name | P1950 | Item | second or maternal family name in Spanish names (do not use for other double barrelled names) | Gabriel García Márquez <second surname in Spanish name> Márquez | - |
| significant event | P793 | Item | key event: significant or notable events associated with the subject Statistics about a first name: use qualifiers to indicate rank series ordinal (P1545) and/or occurrences quantity (P1114) |
Cornelis <significant event> most frequent first names at birth in Rotterdam (1811-1913) | - |
| significant event | P793 | Item | key event: significant or notable events associated with the subject Status in Iceland: use given name authorized in Iceland (Q26959205) as value. |
Arnar <significant event> given name authorized in Iceland | - |
Uses[edit]
For a given name[edit]
| Title | ID | Data type | Description | Examples | Inverse |
|---|---|---|---|---|---|
| instance of | P31 | Item | instance of: that class of which this subject is a particular example and member. (Subject typically an individual member with Proper Name label.) Different from P279 (subclass of). | John <instance of> male given name | - |
| language of work or name | P407 | Item | for works (for original language use P364 and for persons P103 and P1412) | John <language of work or name> English | - |
| said to be the same as | P460 | Item | Wikimedia duplicated page and synonym: this item is said to be the same as that item, but the statement is disputed | John <said to be the same as> Jean | 460 |
| part of | P361 | Item | part: object of which the subject is a part. Inverse property of "has part" (P527). | Sylvain <part of> Sylvain | has part |
| family name identical to this given name | P1533 | Item | last name that is the same as a given first name. Use on items for given names | Sylvain <family name identical to this given name> Sylvain | - |
| given name version for other gender | P1560 | Item | equivalent name (with respect to the meaning of the name) in the same language: female version of a male first name, male version of a female first name. Add primarily the closest matching one | Riccardo <given name version for other gender> Riccarda | 1560 |
| opposite of | P461 | Item | opposite and antonym: item that is the opposite of this item | Jean <opposite of> Jean | 461 |
- Create a new item when none is available for a name.
- Labels for given names should be the same in all languages with Roman script.
- Alias: languages in Roman script generally would not have any aliases.
- People with translated given names could have several items for the same given name.
- People with several given names will have several values in given name (P735).
- Avoid adding items with Wikimedia disambiguation page (Q4167410) or items that link to disambiguation pages in one of the languages.
- The property should only be used on items for persons (humans or fictional humans).
For a family name/surname[edit]
| Title | ID | Data type | Description | Examples | Inverse |
|---|---|---|---|---|---|
| instance of | P31 | Item | instance of: that class of which this subject is a particular example and member. (Subject typically an individual member with Proper Name label.) Different from P279 (subclass of). | Fisher <instance of> family name | - |
| named after | P138 | Item | eponym: entity or event that inspired the subject's name, or namesake (in at least one language) | Fisher <named after> fisherman | - |
| part of | P361 | Item | part: object of which the subject is a part. Inverse property of "has part" (P527). | Sylvain <part of> Sylvain | has part |
- Labels for family name should be the same in all languages with Roman script.
- Data in the property "family name" is not meant to indicate any sort of kinship between people with the same surname.
- Avoid adding items with Wikimedia disambiguation page (Q4167410) or items that link to disambiguation pages in one of the languages.
- The property should only be used on items for person (humans or fictional humans).
For a combined family name with given name[edit]
The approach is the following:
- Use instance of (P31) = name (Q82799) on the combined "name" item. Example: Alonso (Q2650702)
- On the specific items for the given name and for the family name, link this with "part of". Example: Alonso (Q18552177) for given name, Alonso (Q18552178) for family name
- No given names/family name properties should use the name item. Alonso (Q2650702) in the example.
Specific naming conventions[edit]
Roman names[edit]
Roman names generally have the following parts: praenomen, nomen, and cognomen. Sometimes, an agnomen is used as well. → en:Roman naming conventions#The tria nomina.
- Properties to use for these are: Roman praenomen (P2358), Roman nomen gentilicium (P2359), Roman cognomen (P2365), Roman agnomen (P2366)
- Nomen should link to the item for the w:Category:Roman gentes -> Category:Roman gentes (Q9815532)
Points being developed[edit]
For given names:
- An item for each first name
- create an item for every variation of a given name.
- Labels
- use identical labels across languages with Roman script. Tools like the new LabelLister or the list at Person names allows to check them and nameGuzzler to define them.
- Descriptions
- use standardized descriptions such as "male given name", "female given name". Tools like the new LabelLister can help check them and remove old ones, Autoedit can add standardized ones. This helps identify and resolve inconsistent or duplicate items. Normalize descriptions with terms used on given name (Q202444), male given name (Q12308941), female given name (Q11879590)
- De-mix disambiguation items
- avoid having items in given name (P735) with "disambiguation" in instance of (P31). Create new items for these given names instead.
- Distinguish between items for given names and items for family names
- separate items should be available for each. family name identical to this given name (P1533) can link them.
- List related given names
- similar first names from other languages can be listed with said to be the same as (P460).
- Complete missing names in the main languages
- Add_name_labels works fine for this
- Distribution maps
- maps from Commons added
- Reasonator
- improved handling of "see also" on given names (displaying languages)
- Lady/Sir/Lord/…
- given names for items with labels starting with "Sir", "Lord", "Lady", etc.
- Japanese names
- Romanized Japanese names can mix several names. Example: "Yuriko" for ゆりこ or ゆり子 or 百合子. Each of these three should have an item. An additional undifferentiated item can be used if it can't be determined which one applies and it can also link a possible article about all three at enwiki.
- Pronunciation
- link files available at Commons
- Japanese names
- description of the approach currently used
Statistics[edit]
|
|
2014
2015
- with given names
- without given names
Tasks and task forces[edit]
- Wikidata:WikiProject Names/first names (1):
Done aims to add given name property to an initial set of 330 male given names. - Wikidata:WikiProject Names/first names (2):
Done aims to add given name property to an initial set of 200 female given names. - Wikidata:WikiProject Names/first names (3):
Done aims to add given name property to 20 most common given names in the US - Wikidata:WikiProject Names/first names (4):
Done aims to add given name property to 20 most common given names in France - Wikidata:WikiProject Names/first names (5)
- Wikidata:WikiProject Names/first names (6):
Done top 60 constraint violations task - Wikidata:WikiProject Names/first names (7):
Done fix or remove links to non-given name items with given name (P735) - Wikidata:WikiProject Names/first names (8)
- Wikidata:WikiProject Names/first names (9)
- Wikidata:WikiProject Names/first names (10): given name labels
Regularly updated reports and dynamic lists[edit]
- Wikidata:Database reports/Constraint violations/P735 (given names)
- Wikidata:Database reports/Constraint violations/P734 (surnames)
- Person names (labels on items used by P735/P734: mixed and identical)
- Given names mixed with disambiguation pages: Autolist: given names and disambiguations
- first 15 persons lacking the given name property (click "run" on page)
- Females with given name: male given name
- Males with given name: female given name
- Items with labels starting with "Sir", but no given name (P735)
- compound given names without has part (P527)
- Most linked disambiguation page items
- People whose value for given name has no instance of (P31)
Points to develop[edit]
For given names:
- How to cover combined given names?
- Given names can be combined with a dash (sample: "Jean-Paul") or without a dash (sample: "John Paul"). For names like "Jean-Paul", there would probably be a single item in given name (P735). For "John Paul", there might be three: "John", "John Paul" and "Paul". Various names should probably be checked to match one or the other way.
- How to ensure given names are correctly set for languages that add them after the family name (e.g. Chinese, Korean, Japanese, Hungarian)?
- We will need to make sure that we don't place family names as given names for these. Sample: "Kim Jong-il" (Kim Jong-il (Q10665)) has the family name "Kim" and not a given name "Kim". Hungarian names are being worked on.
- "Master of altar of the church"
- review, possibly, localize names including "Master of.." etc.
- Senior/Junior
- Localize (or not) when included in name.
- Module/infobox
- build a sample infobox. Possible solution: add given name (P735) with "no value" and qualifier instance of (P31): master of … (Q19968968)
- Integrated Icelandic last names
- tbd
- Roman cognomen
- determine how to handle them
- Indian names
- check names in the format <family name> - <given name>
- instance of (P31)
- define usage for items like mononymous person (Q2985549)
- Inclusion of etymology
- this should be included in a structured way. It is currently left to Wikipedia infobox and/or Wiktionary (July 2015 property proposal, May 2015 Wiktionary absorption proposal). Please don't add etymologies to descriptions instead.
- Inclusion of the new transliteration properties
- recently a series of transliteration properties became available, these should be included in the model.
Tools[edit]
- The gadget "autoEdit" ("Automatic addition" on the left toolbar) can add a description for family names to a series of languages. (For first names, try this).
- nameGuzzler ("VIP label" on the left toolbar) can set the label in a series of languages.
- LabelLister allows to edit/view labels, descriptions and aliases in several languages 1-by-1
- LabelLister BETA version could edit labels, description and aliases in all languages at once.
- Autolist2 can search for given namens in labels and add corresponding labels (doesn't work for family names and the "permanent link" feature on the interface doesn't work either)
- Sample search for "Lara": [1] (click "run" on the page).
- Selection of clean "given name" items: Autolist: given names
- Selection of clean "family name" items: Autolist: family names
- Given names by language: Catalan, Italian
- Lists of given names (click "Show")
- Various queries on "Quarry"
- Most frequent given names (query by Jheald)
- Template:Interwikis from P460 (Q21529474) adds interwikis from items included in said to be the same as (P460) to a Wikipedia article.