Commons:ファイル名の付け方
ウィキメディア・コモンズにアップロードするファイルには適切な名前をつける必要があります。この記事では、以下のようないくつか考慮すべき点を挙げています
Purpose of file names
Names are used to uniquely identify the item involved. Names should be
- 説明的でなくてはいけません。その画像が何を表示したものか、あるいはコンテンツが何を描写するものか、に応じて選ぶべきです。
- 正確であるべきです。科学的な名前、固有名詞、年月日、などを用いるときには特に。
Contributors categorizing files frequently have different demands from those who create, process, manage and upload them. Unless there is a compelling reason not to, the uploader's choice of name should be honored. This is a courtesy, not an absolute right, however. If a name is disruptive or inappropriate, a different name may be chosen.[1]
Naming conventions
Media files can be uploaded with names in any language in any script (coded as UTF-8) - see Commons:Language policy. The filename extension (e.g., .jpg) should match the file format (e.g., JPEG), and not be doubled (e.g., .jpg.jpg or .tiff.jpg). The filename should clearly describe what the file contains, but it should be reasonably concise. Whether a filename is perceived to be suitable often depends on the familiarity of the individual with the subject. It also occurs that items are known under different terms to different contributors or they believe entities are primarily well-known under the term they are used with to describe this entity, neglecting cultural differences, even within countries. Therefore, it is likely that no single name will completely satisfy all contributors. The following lists various characteristics of a good file name. These should be seen as goals, not as rules written in stone. Generally speaking, stable filenames are more important than perfect filenames, and missing a few of these goals is not justification for moving the file. But when in doubt, aim for a stable more generic name.[2]
Descriptive
- Meaningful – Names should not consist entirely of auto-generated letters and numbers, such as "DSC123456.jpg". Commons uses the Filename prefix blacklist and the Titleblacklist to enforce this policy.
- Content-based – Prioritize describing the subject matter over indicating its origin. Do not use names that solely consist of dates, the name of the photographer or rights holder, terms like “Flickr”, “original”, “crop”, and/or catalogue numbers. For example, File:20110428 OH K1023900 0014.JPG - Flickr - NZ Defence Force.jpg does not describe its subject matter at all despite its length. However, it is acceptable to include such information within the name so long as the name remains reasonably brief and also contains a description of the subject matter. Remember that details like photographer name and source information can alternatively be included in the file's metadata or description. Situations where the subject is identified with a date, such as the book 1984, are exceptions to this criterion.
- Specific – For images of places, the name should describe a specific location with phrasing that aids in figuring out where the image was taken if it is known by the uploader and what the image depicts, such as Colcester Zoo, 18 Rue Norvins, or Anime Expo 2022. The name should not consist primarily of a broad location, such as File:Paris 319.jpg, Ontario hill, or Japan train station, where the location is so large that only someone who knows the area very well can identify the image. Similarly, unless the file is an icon, clip art, or other illustration where a broad category is most descriptive of the subject, the name should not consist primarily of a generic or broad category such as a word like "smartphone", "screenshot", "queen" or "bird", but rather impart detailed information that would help someone identify the specific object depicted, such as "Nokia N8 Blue (Front)".
- Precise – The name should unambiguously identify the file's subject, and distinguish it from other similar subjects. For example, File:Michaeljackson.jpg should have included some information to distinguish itself from files in Category:Michael Jackson. For distinguishing places, refer to our sister project Wikivoyage:Naming conventions § Disambiguation.
- Correct – The name should describe the file's content and convey what the subject is actually called. Inaccurate names for the file subject, although they may be common, should be avoided. The title given to a work of art by the artist that created it is considered appropriate, even if the name has nothing to do with what is depicted (for example, many works of Dadaism). The name should also be free of obvious errors, such as misspelled proper nouns, incorrect dates, and misidentified objects or organisms. Users are allowed to upload "unidentified" or "unknown" organisms but such files may be renamed upon identification.
- Time – Include the year or date a photo was taken
Clear
- ファイル名は240バイトの長さまでです[3]。ファイル名が240バイトを超えるファイルに上書きアップロードしようとすると、ファイルがひどく壊れてしまう可能性があります(15バイトの年月日時が prefix としてファイル名に追加されるため)。英語のファイル名は通常、1文字につき1バイトです(一部の記号は ASCII文字セット外です)が、他の言語や、非ASCII文字では、1文字につき最大4バイトを要する場合もあり、240バイトで表示される文字数は240文字よりずっと少なくなることがあります。
- Spelled out – Abbreviations, acronyms, and a person's initials are often ambiguous and thus should be spelled out. Although such initialisms are related to the subject of the file, the meaning is not immediately clear to the reader. Spelling out a subject's full name also aids significantly in searching. For purposes of concision, it is allowed to use well-known acronyms and initialisms such as NATO, so long as other parts of the name provide sufficient information to identify the subject, or to use abbreviations for the image source.
- Recognizable – As many as possible should be able to understand the name, whether they are an expert, someone familiar with the subject area, or someone on the street.
- Intuitive – Names should anticipate what users are likely to type when looking or searching for the subject. Significant keywords not present in the name should be included in the description or metadata.
Practical considerations
- Unique – By the design of Commons's software, no two files can have the same name. To prevent collisions, it is encouraged to add strongly distinguishing information such as a source, date, or catalogue identifier, although it is not always necessary.
- Appropriate – Names should be neither vulgar (unless unavoidable) nor pedantic. Names apparently created for the purpose of vandalism, attack, or provocation, such as libelous, insulting, degrading, crude, or offensive descriptions, names containing inappropriate or non-public personal information, or names that are blatant advertising or self-promotion, will be removed immediately. For example, an image of a person with the name "File:1BIGGest_nOSE_everS33n.JPG" is unlikely to remain. Names that are or have been associated with nationalistic, religious or racist causes are allowed provided they are legal to host and otherwise fall within Commons scope, for example a filename like "File:Taiwanese Tiaoyutai islands map.png".
- Neutral – Refer to Commons:Project scope/Neutral point of view
- There is no obligation to use a specific language for new uploads, even if other files in a subject area use that language. Any relevant spelling variation may be used.
Tie-breaking criteria
- Common – When considering different names for a subject, names that are more commonly used (as determined by prevalence in reliable sources) are preferred. Search engines, international organizations, media outlets, encyclopedias, databases, scientific bodies, and scientific journals may be consulted to identify the most common name(s). For places, the name should be the most commonly-used name in the local language.
- Consistent – The name should be consistent with the pattern of similar files' names. Many naming conventions exist, and sometimes conflict with other criteria; there is currently no standard set of conventions. The costs and benefits should be weighed when considering any specific convention.
- For batch uploads, use a consistent filename template based on what data is available. Sample templates include
{title} ({source})
,{title} - {source} {id}
, and{brief_description}, {year}
. - Files that form parts of a whole (such as scans from the same book or large images that are divided into smaller portions due to Commons’ upload size restriction) should follow the same naming convention so that they appear together, in order, in categories and lists.
- Certain complex templates (such as those that use BSicons or that display football kits) assume that the images used in them will follow a specific naming convention. Wikisource also uses a specific naming convention for the source files they transcribe.
- For batch uploads, use a consistent filename template based on what data is available. Sample templates include
Language-specific guidelines
These guidelines apply to names in English. Speakers of other languages may define guidelines for their language in the relevant translations.
- The preferred name style is sentence case (downstyle) with initial capitalization and without ending punctuation, e.g. "Smoky sunset in Taiwan", as it is more readable and contains the most information. Other conventions such as title case may make sense in the context of batch uploads, as it is difficult to infer sentence case from all caps.
- Articles (a, an, the) can often be removed without changing the meaning.
- Names are not full sentences, but small bits of information. In most cases, the proper length is between two and twelve words. One-word names are almost always too ambiguous, and should be avoided. If the name is 20 words it is probably too long, and if it is 30 or more, it is almost definitely too long. English filenames will usually use 1 byte per character (some symbols may fall outside the ASCII character set), allowing 240 characters or approximately 50 words as the hard maximum. For non-ASCII characters, 240 bytes may be much less than 240 characters, as these can take up to 4 bytes per character.[4]
- Names should not go out-of-date. Avoid words and phrases like "current", "incumbent", "expected", "recently", "soon", or "next year", preferring more precise language such as "in 1969" or "fifth president".
- Avoid abusing Unicode. Control characters can be omitted, strange punctuation can be replaced with standard quotes and commas, and symbols such as "♥" are often more natural when spelled out ("heart"), also increasing visibility in search. Furthermore some characters do not render correctly at all in certain operating systems and browsers. It is a good idea to stick to letters, numbers, underscore (space), ASCII hyphen/minus/dash, plus, and period (dot), as these do not have any MediaWiki restrictions. Letters with diacritics and accents are acceptable, but so is omitting diacritics and accents (e.g. "Calderón"/"Calderon", "Erdoğan"/"Erdogan").
脚注
- ↑ Commons:File renaming
- ↑ Commons:Requests for comment/File renaming criterion 2, Commons:Blocking policy, Commons:Project scope#Examples, Commons:Revision deletion
- ↑ 2011年後半までは255バイトが上限でした (Phabricator: T32202) ので、既存のファイルには255バイトのものがあるかもしれませんが、新規にアップロードする場合は240バイト以下に制限されています。
- ↑ Phabricator: T32202