Changes to GEOLAND

The following discussion is an archived record of a request for comment. Please do not modify it. No further edits should be made to this discussion. A summary of the conclusions reached follows.
Withdrawing - option 2 stands no chance of passing and the discussion has clearly run its course. Happy to reinstate if anyone wants but no point dragging this out. FOARP (talk) 14:44, 8 October 2023 (UTC)

Regarding the section "Settlements and administrative regions" of the notability guide for geographical features (WP:NGEO), which of the following should we do?:

Settlements and administrative regions are typically presumed notable, provided information about them beyond statistics, region, and coordinates is known to exist. The extent of coverage in secondary sources should be considered to ensure there is enough verifiable content for an encyclopedic article. Editors should avoid creating very short articles on these topics, e.g., "Village X is in County Y, Province Z of Country A, at coordinates C, with a population of P." is not enough content for a stand-alone article.
If a Wikipedia article on a settlement or administrative region cannot be developed using known sources, information on it should instead be included in a more general article on a higher-level administrative region, or in a list article collecting information on similar settlements or regions (per the list notability criteria and common list selection criteria #2).
This criterion applies to settlements and administrative regions including states, provinces, municipalities, districts, parishes, counties, cities, villages, towns, and their equivalents, that provide general-purpose administration for their region, regardless of whether they presently exist or are abandoned, even if their population is low. It does not apply to special-purpose districts such as census tracts, irrigation districts, library districts, etc. which should instead be assessed on a case-by-case basis according to the general notability guideline.
This change to be made in combination with the amonishment that editors should avoid flooding AFD with articles (e.g., more than 10 nominations per day as with minor planets) that fail the amended guide. FOARP (talk) 16:22, 15 September 2023 (UTC)
The current "Settlements and administrative regions" guideline, which would be removed in favor of the text above
  • Populated, legally recognized places are typically presumed to be notable, even if their population is very low. Even abandoned places can be notable, because notability encompasses their entire history. Census tracts, Abadi, and other areas not commonly recognized as a place (such as the area in an irrigation district) are not presumed to be notable. The Geographic Names Information System and the GEOnet Names Server do not satisfy the "legal recognition" requirement and are also unreliable for "populated place" designation.[1][2]
  • Populated places without legal recognition are considered on a case-by-case basis in accordance with the GNG. Examples may include subdivisions, business parks, housing developments, informal regions of a state, unofficial neighborhoods, etc. – any of which could be considered notable on a case-by-case basis, given non-trivial coverage by their name in multiple, independent reliable sources. If a Wikipedia article cannot be developed using known sources, information on the informal place should be included in the more general article on the legally recognized populated place or administrative subdivision that contains it.
  • Disputed regions are generally considered case-by-case. Their notability for Wikipedia is independent of the validity of their claims. Sometimes it may be more appropriate to merge these articles into ones on a broader conflict or political movement, or to merge articles on multiple disputed names for the same region into one article.

References

  1. ^ "Wikipedia:Reliable sources/Noticeboard". See also WP:GNIS.
  2. ^ "Wikipedia:Reliable sources/Noticeboard".

Background

The proposed amendment is the result of a discussion on the NGEO talk-page involving the editors @Davidstewartharvey, Bookku, Certes, North8000, Horse Eye's Back, Selfstudier, Crouch, Swale, Visviva, BilledMammal, Hut 8.5, Bkonrad, Newimpartial, Hike395, Donald Albury, Viridiscalculus, Mxn, Avilich, Dlthewave, Choess, Joe Roe, Ezhiki, Rschen7754, Atlantic306, XOR'easter, and Imzadi1979:. This change was prompted by discussions involving sets of mass-created articles about Geographical entities based primarily or solely on database sources including:

Survey (Changes to GEOLAND)

Thebiguglyalien explained it better BilledMammal (talk) 20:29, 15 September 2023 (UTC)
The following discussion has been closed. Please do not modify it.
  • My understanding is that you are opposing this because you believe we should include this content in lists, and you believe that through NOPAGE the current wording permits this. I disagree with this, because NOPAGE is not relevant to deciding whether a topic is notable; instead, such lists are permitted through NLIST. If this is incorrect, please correct me. BilledMammal (talk) 20:18, 15 September 2023 (UTC)
    I think:
    • Whether a populated place is covered solely through statistics (extremely broadly construed concept, for populated places) or not is something where being able to argue about it for ages, rather than shrug and move on to the next article, is not a net positive to the project or its encyclopedic coverage (there are multiple subsets of this one; whether people can easily find non-statistical sources depends on factors other than their existence, 'statistics-only' for a populated place is a concept such that it is not in all circumstances incompatible with a valuable article, etc)
    • The only reason we care about this is because mass creation burns the commons/ruins everything, by producing a vast number of articles that don't have reference work value
    • But "large number of stubs that won't realistically be expanded" is solvable through routes other than changing notability, because they can be upmerged, because notability is specifically "below this you can't have an article" rather than "above this you must have an article"
    • Accordingly, the specific problem here is solvable through routes other than "raising the minimum threshold under which it's impossible to ever have an article"
    Vaticidalprophet 20:27, 15 September 2023 (UTC)
Thryduulf I've switched to my proposed Option 3 then, which I think is stronger than the others..♦ Dr. Blofeld 13:00, 22 September 2023 (UTC)

Discussion (Changes to GEOLAND)

Extended discussion and responses here please... FOARP (talk) 16:25, 15 September 2023 (UTC)

So it looks like FOARP brings this up every year because they clearly don't like the idea that Wikipedia exists in part to detail places where people live. A couple years ago there was a unilateral change which was then brought to the larger community and thoroughly rejected. This would have the same effect - by making NGEO slightly more difficult to apply at AfD, we would tighten our notability guidelines and lose one of our purposes as an encyclopedia.
The The idea wikipedia is a gazetteer was boldly added by Unitedstatesian in 2008 and reflected how NGEO worked at AfD (this was later made a guideline.) Even though it was boldly added, its place was further confirmed in discussion in 2009. In reality, right now, the NGEO test is very easy: if we can prove people live or lived in a place, it's eligible for a stand-alone article, similar to how gazetteers work. This would require us to be able to say something about the place, which would instantly put a number of places at risk of deletion, especially in non-English countries or in places where you can't just web search for a village.
The reason is clear: even though we may not be able to say anything about a verified populated place now does not mean we'll never be able to: the fact people live in a locale means there's something notable about that locale, even if there's just an oral history and it's not on the internet yet. A lot of the deletion discussions I've been a part of have been about verifiability, for instance the American GNIS stubs, and I can see a change leading to a lot of negative outcomes where we would delete verifiable places without easily accessible online information. SportingFlyer T·C 21:41, 15 September 2023 (UTC)
SportingFlyer - You said you were going to find a source to support your assertion that Wikipedia is a gazetteer. None of these links do so. Even WP:5P (an essay) only states that it includes elements of gazetteers. FOARP (talk) 14:49, 16 September 2023 (UTC)
It's in there. It was boldly added to reflect geography policy at early AfDs and then was further accepted by the community through discussion. SportingFlyer T·C 21:31, 16 September 2023 (UTC)
It's in where? A simple quote will do. FOARP (talk) 12:47, 17 September 2023 (UTC)
Again it goes all the way back to the first edit of NGEO, and the concept was endorsed as much a little under two years ago: [7] SportingFlyer T·C 09:45, 18 September 2023 (UTC)
... which only ever said that Wikipedia includes "combines many features of ... gazetteers", not that it is a gazetteer. FOARP (talk) 10:27, 18 September 2023 (UTC)
However given how many people in this and related discussions base their argument on Wikipedia being/including a gazetteer I'm not convinced that the distinction you are making is one that is supported by community consensus. When it comes to the inclusion of information about populated and formerly populated places, I don't think the distinction is either meaningful or useful. Thryduulf (talk) 14:20, 18 September 2023 (UTC)
I think people should recognise this “Wikipedia is a gazetteer” argument for what it would logically require: that Wikipedia give the same level of coverage to geographical features that gazetteers do. This means extending our coverage to every single named feature, regardless of notability or whether there is anything verifiable to write about it beyond statistics. FOARP (talk) 03:25, 22 September 2023 (UTC)
We should have coverage of every populated place we can verify. That doesn't necessarily mean a standalone article, but nothing in current policy or guidelines requires that so it's not a problem. Whether we have a similar coverage of every named feature, if that's what the community wants then I see no issue with that. Thryduulf (talk) 08:25, 22 September 2023 (UTC)
"We should have coverage of every populated place we can verify" - both my house and the street it is located on are verifiable and populated. They even have "legal recognition" in the sense that they are listed in the official census/gazetteer. FOARP (talk) 09:34, 25 September 2023 (UTC)
Okay, replace the word "place" with the word "settlement". Although I would not be opposed to verifiable "list of streets in X" articles, with links to notable streets and/or notable buildings on that street.Thryduulf (talk) 10:39, 25 September 2023 (UTC)
We should have coverage of every populated place we can verify. We already do; what we don't have is coverage of every populalated place on the most granular level, but why should we? What benefit would this level of granular and indiscriminate coverage bring? BilledMammal (talk) 10:23, 25 September 2023 (UTC)
I didn't say anything about indiscriminate coverage or excessively granular detail, so you are attacking a straw man. The current policies allow for the appropriate level of coverage in the appropriate place, which is sometimes a standalone article and sometimes inclusion on a broader article. The proposal would preferentially delete rather than merge content, especially where English-language sources are not trivially available on Google increasing systematic bias. Thryduulf (talk) 10:37, 25 September 2023 (UTC)
You said We should have coverage of every populated place we can verify. The implication was that you won't be satisfied with coverage of populated places at anything but a very granular level - or indiscriminate - level. Am I incorrect? BilledMammal (talk) 10:56, 25 September 2023 (UTC)
Yes you are incorrect, "coverage" is not a synonym of "indiscriminate or excessively granular coverage", nor is it a synonym of "every populated place must have an individual article" or any of the other straw men being argued against. "Coverage" means "we should have information about the subject on an appropriate article", with the most appropriate article depending on the situation. Thryduulf (talk) 11:07, 25 September 2023 (UTC)
Yeah, I think the primary result of this would be "more uncertainty" rather than "better project". "Statistics-only" for a populated place is extremely broad; it's possible to write a significant amount about a place based solely on its census stats. I could pull out any neighbourhood in my city and assemble its census stats into a legitimate-reference-work-addition. In fact, just to test my point, I've done that and massively expanded Clayton, Victoria#Demographics. There's a lot more to the article than this section -- but if you replaced the whole article with that section and a lead saying Clayton is a suburb in Melbourne, Victoria, Australia, 19 km south-east of Melbourne's Central Business District. It is the home of Monash University, you would have a substantially more explanatory article than the one that currently exists. I could write more there, but I very consciously restricted myself only to census results and to one article based off census results (because its secondary analysis drew a comparison the results themselves don't). The problem we have right now is that there are mass-created articles for which it's unrealistic to assume mass expansion, which is a different problem to what you can write based on something. Vaticidalprophet 22:36, 15 September 2023 (UTC)
Purer example: I remembered Salisbury East, South Australia being a stub. Excitingly, it was even an uncited stub! It's not now. I think this tells you a fair bit about the place. Vaticidalprophet 23:03, 15 September 2023 (UTC)
The process was not started by FOARP, but my me. If you had looked at the actual discussion instead of just reading the blarb at the top, the discussion was started on the basis of the actual wording of the current GEOLAND. Legal recognised place is virtually impossible to actually evidence in large parts of the world. This is why the phrase has been changed to Settlements and Administrative Regions. From that discussion the issue of permastubs came up. Now I don't have a problem with stubs if they are accurate, but census data being the only evidence to create an article is not really good enough for an encyclopedia. Glasgow University History of the British Census says "It is essential to bear in mind when examining census data that the census it is not a completely accurate record." And that is in a western country where census data is seemed to be accurate, as pointed out in the original discussions how many census in parts of the world give a name to a nomad village (such as Abadi) which has disappeared by the next census and a new one appears? As pointed out by Vaticidalprophet, he took an article which was just statistics and expanded hugely, so it is possible in a large number of cases. However sometimes you can't. Hare Green, a hamlet in Essex is a perfect example. It was just evidence by a map and census data. I have filled out with all that I can find to expand it - some historical houses and some info about the roads. None of it would meet GNG. To be honest I live in Essex and had not heard of it, and if it was redirected to Great Bromley and added there we would not lose the information.Davidstewartharvey (talk) 06:33, 16 September 2023 (UTC)
Hare Green doesn't seem a great example to me. There are two 17th-century listed buildings, suggesting a long history which will be covered in (offline) local history books. The population is over 700, which is a small village, not really a hamlet. The settlement will appear in local newspapers, probably for hundreds of years, and certainly in multiple historical directories, which will probably turn out to include material on local public houses and businesses. 'I've never heard of it' is never a reason for deletion.
When academic sources state that the historical UK census is not completely accurate, what's generally meant is that old handwritten records can be difficult to decipher and people gave their names in different forms, so tracing individuals can be tricky, not that a whole village was incorrectly entered! Espresso Addict (talk) 07:31, 16 September 2023 (UTC)
I am not saying something shouldn't exist cos i have no heard of it! I have done more than internet searches - as I said I am local and found very little beyond architecture. Because something has existed doesn't mean anything interesting exists and needs an article about it based on that it just exists. If the information was added to the Great Bentley page and the page redirected there, we are not losing anything as there is not much to lose.
The Glasgow University comment is on the British Census, but it counts for census' around the world. How many people put down Jedi as their religion? And although you maybe bound by law to complete the census in parts of the world, they are actually not considered legal documents to confirm settlements actually legally exist in many parts of the world either.Davidstewartharvey (talk) 09:32, 16 September 2023 (UTC)
@ Davidstewartharvey: Hare Green satisfies GNG and, on account of the listed buildings, GEOFEAT. Your expansion is far from exhaustive. You have said virtually nothing about the two listed buildings, despite the fact that there is more coverage. You have not even mentioned the ancient earthworks: [8]. No apparent use of the coverage in GBooks, GScholar and the Internet Archive. You have made little use of the newspaper articles in GNews, which contain more information than you have added. You have not added any of the newspaper articles in the British Newspaper Archive [9] [10] [11], or in Newspapers.com [12] [13]. I don't see any use (beyond what is included in GNews) of the websites of the essex newspapers like this one: [14]. (GNews only contains articles from the last few years, whereas newspaper websites often go back to the 1990s). More importantly, Great Bromley itself certainly does satisfy GNG by a wide margin, but would be in real danger of an AfD or draftification under this proposal, because the sources are not in the article yet. I can put up with a place like Hare Green being merged, but I can't put up with a place like Great Bromley being mistakenly deleted or draftified, which would be the probable outcome of this proposal.
As for the census: There is no reason to assume that any reliable source is completely accurate. It would not surprising if every reliable source in the world contains errors. If we rejected every source that contained at least one error, it would not be surprising if we end up with no sources at all. I happen to be aware that sources say that even the New York Times contains some errors (such as in its coverage of the Russian Revolution) and is therefore not completely accurate. James500 (talk) 10:11, 16 September 2023 (UTC)
How does the current article meet GNG? Firstly maps can't be used to define notability, the census data is not from ONS but a database site based in Germany - very questionable. The road references - one a local newspaper and the other's primary source as it is the official legislation. The references on BNA - sales of properties and livestock, a barn fire with a death, are these really enough to establish GNG? And as to GEOFEAT, I remember that there was a discussion a few years back that agreed a concensus that Grade II properties are were not accepted as notable based just on a listing. This was based on the fact there is 343,004 Grade II listed along in England. In addition the Henge is not in Hare's Green but in Little Bentley.Davidstewartharvey (talk) 15:47, 16 September 2023 (UTC)
I read that discussion about GEOFEAT. IIRC, it was not an RfC, established no consensus, either to change the wording of the guideline or otherwise, and it was based on a pseudohistory argument that because the population of the UK is smaller than that of the USA today, the UK should have fewer listed buildings than the USA. In fact, buildings are not normally listed in the UK unless they were built before 1840, and are not automatically listed unless they were built before 1700. The UK had a much larger population than the USA before 1840, and a vastly larger population before 1700. The UK has a much larger number of pre-1840 buildings, and a vastly larger number pre-1700 buildings, than the USA. Therefore the UK should have a much larger number of listed buildings than the USA, because the UK has more historic buildings than the USA, and the UK's historic buildings are generally more important than those of the USA. The reality is that the NRHP largely consists of late 19th and 20th century buildings that would never get listed in the UK in a million years, because their historical importance is very low. The number of grade II listed buildings in the UK is reasonable for a country that has a massive concentration of medieval and early modern buildings of immense historical importance, something that does not exist in the USA, which has no medieval buildings and almost no early modern buildings. James500 (talk) 00:11, 17 September 2023 (UTC)
BAR British Series 175 says that the penannular ditch is in "Hare Green, Great Bromley". Unless you have another source, it is in Hare Green. James500 (talk) 00:51, 17 September 2023 (UTC)
there are two henges one in Great Bromley map ref 51.860498N Longitude: 1.054156E [1] and one in Little Bentley map ref Latitude: 51.884112 Longitude: 1.062716 [2] Other henges in the Tendring area is at Great Wigborough Latitude: 51.7942 / 51°47'38"N
Longitude: 0.8369 / 0°50'12"E, Tye Henge in Lawford Latitude: 51.936516N Longitude: 1.036039E, though the closest to Hare Green is Little Bromley Latitude: 51.905055N Longitude: 1.035369E. The book is the only reference to a Henge in Hare Green. Davidstewartharvey (talk) 16:56, 17 September 2023 (UTC)
And further more upon checking the website https://www.citypopulation.de/en/uk/eastofengland/ which is the source of the population data, it quotes states Source: UK Office for National Statistics (web).
Explanation: All population figures and depicted boundaries are based on output areas officially assigned to the 2022 built-up areas. Output areas often include some unbuilt parts. However, tabulated area figures refer to (typically smaller) actual built-up areas in order to present a more realistic population density. Some of the older population figures are approximate values. Well when you check the data that is currently on the site (supposedly 2021 data) it states Hare Green has a population of 678. However Great Bromley is not in there list, and when you check the data on the ONS [3] it says the parish of Great Bromley has 1100 people, with no further breakdown available. So the reference is actually duff!Davidstewartharvey (talk) 17:45, 17 September 2023 (UTC) Davidstewartharvey (talk) 17:45, 17 September 2023 (UTC)
Trivial mentions do not count towards GNG. This is very explicitly stated. Hare Green being mentioned once as a location in local news stories (failing NOTNEWS) is not detailed, direct coverage of Hare Green. It is indirect and incidental. List entries for individual historic structures that happen to be in Hare Green are also not direct significant coverage of Hare Green. And literally no one would be trying to get Great Bromley deleted with this proposal, what a ridiculous leap. JoelleJay (talk) 21:32, 16 September 2023 (UTC)
Your argument is that a village cannot be notable for significant coverage of its history or its buildings. In other words, you are arguing that a village can never be notable, since there is no other coverage that a village could receive (unless you want coverage of the quality of its subsoil). I do not think that your argument is within the letter, let alone the spirit, of GNG or NGEO. And all that your argument could achieve, at most, is a page move to something like History and buildings of Hare Green, which will put you in violation of WP:COMMONNAME. James500 (talk) 02:01, 17 September 2023 (UTC)
If a building that happens to be in Hare Green receives SIGCOV that doesn't actually discuss Hare Green itself it is not coverage of Hare Green. This should be exceptionally clear from our P&Gs. And the buildings in question don't even have SIGCOV themselves, they just have a context-less, nearly-prose-less description of the building and items inside.

TM 02 SE GREAT BROMLEY HARWICH ROAD HARE GREEN (north side)
House. C17 with later rear extension. Timber framed and rough rendered. Red plain tiled roof. Central red brick chimney stack. 2 storeys. 2 window range of small paned vertically sliding sashes. Central 2 panel 2 light door, simple surround. Interior features include vertically boarded and panelled doors, some with HL hinges. Inglenook fireplace with cast iron fireback reading "1640", "Fairfax" and depicting a man on horseback. Iron meathooks to rear kitchen ceiling beams.

If you seriously think that is direct significant coverage of Hare Green then I seriously question your understanding of our policies. JoelleJay (talk) 03:18, 17 September 2023 (UTC)
(1) You have not answered what I said to you at all. You have instead twisted my words and put words into my mouth.
(2) WP:SIGCOV does not contain a definition of the word "directly" (other than "so that no original research is needed to extract the content") and it is not acceptable for you to claim that your personal opinion about how that word should be interpretated is community consensus unless you can produce evidence of that alleged consenus. Could you, for example, provide me with a link to an RfC on the meaning of the word "directly" in WP:SIGCOV which confirms that your interpretation of that word actually is supported by community consensus?
(3) At this point, I really suspect the fact that you are still making comments like "nearly-prose-less" tells me everything I need to know about the level of community support for your opinions. I happen to remember that your theory (and it is still only a theory) that significant coverage must consist of what you refer to as "prose" was discussed during WP:ACAS. The community pointed out to you that GNG contains no such requirement, and your personal interpretation that GNG requires what you refer to as "prose" was totally rejected by community consensus. Your proposal in Q14 was simply rejected out of hand by the community.
(4) If you don't stop making unevidenced claims about alleged community consensus about the interpretation of particular words in guidelines, I could put a stop to that by starting an RfC myself on the meaning of "directly", to find out what the community really thinks. James500 (talk) 05:35, 17 September 2023 (UTC)
1. I'm the one twisting and putting words in your mouth!? You literally said Your argument is that a village cannot be notable for significant coverage of its history or its buildings. In other words, you are arguing that a village can never be notable, since there is no other coverage that a village could receive (unless you want coverage of the quality of its subsoil). That is either an intentionally outrageous mischaracterization of what I said or your reading comprehension is unsuitable for this project. I didn't make a single statement that claimed to restate your arguments whatsoever; the closest would be saying If you seriously think... which is very clearly not saying "You seriously think".
2. What do you believe "directly" in "Significant coverage" addresses the topic directly and in detail means if it doesn't mean the topic must be discussed directly! Do you believe that a source's SIGCOV is distributed to all topics that are named within it?? A source on a movie that trivially names who its director was before going into detail that has nothing to do with the director's role is not SIGCOV of the director, obviously.
3. I don't know how you are reading that the community "totally rejected" the interpretation that secondary coverage should be in prose from a proposal in the very specific context of limitations on acceptable sources for mass creation that the closers said had no discernible consensus. Most of the opposes were against "disallowing mass creation of articles that are based on any database" or were somehow misinterpreting the proposal as asserting that a database isn't RS, rather than making any statement on the prose-non-prose database distinction.
4. You are more than welcome to ask the community whether "directly and in detail" includes all coverage adjacent to a trivial mention of a topic regardless of whether any coverage is actually specifically about the topic. JoelleJay (talk) 02:50, 18 September 2023 (UTC)
It does not matter whether you expressly claimed to restate my words or not (and the words "If you seriously think" and "This should be exceptionally clear" were close enough), if you answered my comment in a way that clearly implied that I had said that the coverage was "direct significant coverage of Hare Green" or clearly implied that I had said that coverage of "a building that happens to be in Hare Green" discusses "Hare Green itself" and "is . . . coverage of Hare Green". If I ask what kind of coverage a village can receive other than coverage of its history and buildings, I should not get a response that implies that I have argued that such coverage satisfies GNG, when I did not comment on whether that coverage satisfies GNG at all in that particular post. I am prepared to accept that you did not realise what you were implying, because I am also starting to have doubts about your literacy and, in particular, whether you understand that you can imply something by means of juxtaposition. (If you put your comment beneath my comment, the juxtaposition can, in of itself, imply that your comment is talking about my comment). That said, I really think it would be better if we both stop questioning each others' literacy and just walk away from this conversation, because it is clear that neither of us wants to talk to the other. Goodbye. James500 (talk) 03:37, 18 September 2023 (UTC)
You asserted Your argument is that a village cannot be notable for significant coverage of its history or its buildings. In other words, you are arguing that a village can never be notable, since there is no other coverage that a village could receive. 1. I did not say anything close to that. I said passing mentions in primary news do not count and existence of listed buildings with zero coverage of the location do not count. 2. there is no other coverage that a village could receive is a claim you are making about what kind of coverage exists for villages. Since my statement only applied to the latter of your two options of "coverage of its history" and "coverage of its buildings" (I never said coverage of a town's history doesn't count, unless you are suggesting that primary trivial newspaper mentions qualify?!), your comment was read as claiming the coverage of the buildings in this particular case is sufficient for notability. So I responded with a rebuttal of this claim. JoelleJay (talk) 20:47, 18 September 2023 (UTC)
When I looked at the talk pages of the notability guidelines, I saw the discussion on WT:N that you are obviously referring to. I should point out that a discussion between three people (yourself, Masem and Whatamidoing) does not create, and is not evidence of, any kind of community consensus. Especially when at least three people (myself, Expresso Addict and Whatamidoing) now object to your interpretation. In short, there is no consensus for your interpretation of GNG. James500 (talk) 23:51, 17 September 2023 (UTC)
How is the mere existence of old buildings enough to assume a location has had SIGCOV? If these sorts of things are essentially conferring notability then that is a serious source of systemic bias since obviously that's only going to work for the places where society has been stable enough that random buildings have survived and there have been modern efforts to document and preserve them and there is a presumption of a written historical record providing significant primary coverage that has itself received secondary analysis (because we certainly cannot be using 17th century sources as the bases for any articles ever). JoelleJay (talk) 03:35, 17 September 2023 (UTC)
[E]ven though we may not be able to say anything about a verified populated place now does not mean we'll never be able to, and even though we will eventually be able to say more about these populated places when interested editors decide to add content, that does not mean we can't upmerge everything shorter than (say) four sentences into a list article now, leaving behind redirects for future expansion. Folly Mox (talk) 15:39, 16 September 2023 (UTC)
@SportingFlyer: comment on content, not contributers. Edward-Woodrow :) [talk] 18:48, 16 September 2023 (UTC)
I haven't personally attacked anyone. Thank you. SportingFlyer T·C 21:33, 16 September 2023 (UTC)
I didn't say that. I reminded you not to comment on contributors. Saying that it looks like FOARP brings this up every year because they clearly don't like the idea that Wikipedia exists in part to detail places where people live. A couple years ago there was a unilateral change which was then brought to the larger community and thoroughly rejected. This is not a personal attack, but it is unnecessary commentary on a fellow editor. Edward-Woodrow :) [talk] 22:38, 16 September 2023 (UTC)
They have consistently tried to make this change over the past few years, without gaining any consensus for it. It's getting tiring. Not sure why that's not allowed to be said. SportingFlyer T·C 12:38, 17 September 2023 (UTC)
@SportingFlyer - You're basically stating that this is my proposal: it was not, very little of the proposed amendment was from me. I linked the discussion where the proposed amendment was developed, and named the editors who contributed to that discussion. "it's getting tiring" - two vaguely-related discussions on the subject in three years tires you? "without gaining any consensus for it." - the proposed edit is the result of a consensus.
We can see from other discussions (NSPORTS, SCHOOLOUTCOMES, NASTRO) that previously over-broad notability standards can be reined in. Proposing the same be done in other areas is just part of the healthy give-and-take of policy on Wikipedia. FOARP (talk) 07:22, 18 September 2023 (UTC)

I'm intrigued by the proposed ten-per-day rate limit on AfD. If I understand correctly, some here fear mass creation of geographical microstubs. So why not put a rate limit on that instead? That could dampen the motivation for someone to run up their new article count by harnessing a poorly understood database. You don't want the waitstaff to take orders faster than the kitchen can fill them, but you also don't want the waitstaff to clear the tables faster than the diners can finish their meals! :^D – Minh Nguyễn 💬 01:56, 16 September 2023 (UTC)

There was a discussion on the subject that got derailed by disagreement over the borderlands of what constitutes mass creation. IMO the obvious thing to ban is automated mass creation -- there are some arguments about certain other forms of high-volume article creation, but there's no number that a human can reach that people agree is problematic (and really shouldn't be). Discussion didn't establish scope, though, so it got bogged down in those, and until then we retain all these everything-not-mandatory-is-forbidden arguments. Vaticidalprophet 02:15, 16 September 2023 (UTC)
” there's no number that a human can reach that people agree is problematic (and really shouldn't be).”
One of the still-operating mass-creators in the GEOLAND field pumps out articles at a rate of up to 330 articles a day based in Russian census data. I can’t conceive of any situation in which that wouldn’t be problematic. FOARP (talk) 06:00, 28 September 2023 (UTC)
I think some kind of forced limitation (maybe using an edit filter) of five articles per day, and no more than 100 articles per month, would slow down any mass-creations like this. Edward-Woodrow :) [talk] 18:52, 16 September 2023 (UTC)
It's certainly possible to legitimately create articles past that rate, though the day one moreso than the month. The problematic sort of mass creation is well beyond what can be done by an actual person; there are people who take an unusually wide view of what mass creation is problematic, but the actual motte of "mass creation produces data integrity and sometimes sensitive-topic-maintenance problems" is implausible for anything human-doable. Vaticidalprophet 20:53, 16 September 2023 (UTC)

I concur with Vaticidalprophet: "two sources = must have an article"-type thinking is a major problem. Mass creation can only be addressed directly, not through changes to notability guidelines. It merits a trip to ANI for WP:NOTHERE. I've thought a good bit about mass creation, and I don't think there's a realistic way to address it except requiring that notability be demonstrated inline rather than "presumed". I don't think automated creation is the real problem. And it'll mean that we don't have microstubs that are useless verbatim reprints of external sources, whether database entries or non-SIGCOV passing mentions. DFlhb (talk) 10:07, 16 September 2023 (UTC)

Please explain it to me like I'm an idiot: why is people writing lots of encyclopaedia articles a problem that we need to address at all? I thought that was kind of the point of this place? – Joe (talk) 14:00, 16 September 2023 (UTC)
Writing lots of articles isn't a problem; writing lots of low quality articles is. I have been considering proposing expanding WP:SPORTSCRIT #5 to all topics; it would effectively prevent mass creation, as the work required to identify at least one source makes it non-viable, and creates a (very low, but extant) minimum quality requirement. BilledMammal (talk) 14:12, 16 September 2023 (UTC)
So the kind of stub discouraged in the proposed guideline (Village X is in County Y, Province Z of Country A, at coordinates C, with a population of P.) is low quality? Why? – Joe (talk) 14:19, 16 September 2023 (UTC)
Because it replicates database-style content, like taxon sub-stubs. Edward-Woodrow :) [talk] 14:36, 16 September 2023 (UTC)
And because it is likely erroneous. We saw that with the US "village" articles based solely on GNIS. We saw that in the Iranian "village" case. We saw that with Dr Blofeld's Indian "villages". We saw that with Lugnuts Turkish "villages". These database sources just aren't made for the purpose that some editors on Wikipedia think they are made for. FOARP (talk) 14:46, 16 September 2023 (UTC)
That's a reason to designate those sources as unreliable or at least unsuitable for certain purposes. I don't see how it's a reason to rule out that whole type of article. XOR'easter (talk) 16:42, 16 September 2023 (UTC)
We’ve already ruled out two of the world’s largest and most heavily-used Gazetteers (GNIS and GEONet Names Server) for certain information. We also ruled out a major class of entity in a census (abadi in the Iranian one) and are currently discussing doing the same with the Russian one.
I’m not sure how many times this needs to be done before we start to look at whether the rule by which every item on these directories warranted an article was actually a helpful one. FOARP (talk) 20:48, 2 October 2023 (UTC)
Except the rule doesn't allow every item on those directories to have an article. The current requirement is that two criteria are met:
  1. The subject is a populated place
  2. The subject has legal recognition
The abadis, railway junctions, etc. fail point 1. The current rule also doesn't require that every place that does meet both criteria has it's own article. Thryduulf (talk) 20:53, 2 October 2023 (UTC)
I'm just going to point out that if a rule is continuously causing issues then there is a problem with the rule that needs to be addressed, even if the issues are caused the rule being misunderstood or misapplied. BilledMammal (talk) 21:58, 2 October 2023 (UTC)
Except that if the instances where the rule is continuously causing issues is when it is being misapplied in violation of another community norm (in this case, the one against bot-like article creation), then it is unclear to me that the problem is best understood as belonging to the rule itself. Newimpartial (talk) 22:05, 2 October 2023 (UTC)
@Edward-Woodrow: WP:NOTDATABASE is a shortcut to our policy Wikipedia is not an indiscriminate collection of information, which in brief states that data should be put in context with explanations referenced to independent sources. I can't see how stubs like this contradict that policy? If anything, a stub is rather the opposite of an "indiscriminate collection of information". – Joe (talk) 16:01, 16 September 2023 (UTC)
Government censuses are not independent secondary sources. JoelleJay (talk) 21:35, 16 September 2023 (UTC)
Independent secondary sources are only a requirement of GNG. Verification is the requirement of NGEO, and government censuses do verify the existence of a populated place (yes, with spot exceptions based on the country). SportingFlyer T·C 09:12, 18 September 2023 (UTC)
Secondary sources are a requirement for NOR. JoelleJay (talk) 17:27, 25 September 2023 (UTC)
This is not correct. Browse Wikipedia talk:No original research for numerous discussions of this. Jc3s5h (talk) 18:41, 25 September 2023 (UTC)
It has recently become clear to me that JoelleJay and BilledMammal read WP:NOR as enshrining a requirement that secondary sources be included in all articles, whether or not those articles offer synthetic claims (which of course, if they do, secondary sources are required) and regardless of whether or not they derive their claim to Notability from GNG (which encourages secondary sourcing). I have tried to point out that this interpretation of NOR as documenting a high-level consensus that supercedes WP:N is not a generally-held community view, but to date I don't feel that either editor has really heard those comments as constructive. Newimpartial (talk) 21:41, 25 September 2023 (UTC)
@Jc3s5h, NOR explicitly says articles cannot be based on primary sources and that non-primary sources are needed to establish notability (and not just through GNG). How could an article based only on primary sources comply with those policies? I'm not saying that primary sources can never be used. JoelleJay (talk) 00:10, 26 September 2023 (UTC)
Incorrect, read what WP:NOR actually says and you'll see that primary sources are perfectly acceptable for the verification of facts. All analyses and interpretive or synthetic claims about primary sources must be referenced to a secondary or tertiary source and must not be an original analysis of the primary-source material by Wikipedia editors. but when there is no verification or interpretation happening a primary sources is fine. Thryduulf (talk) 18:42, 25 September 2023 (UTC)
What WP:OR says in that primary sources can be used in an article, but for an article to exist secondary or tertiary sources must also be used; Do not base an entire article on primary sources and Secondary or tertiary sources are needed to establish the topic's notability and avoid novel interpretations of primary sources.. BilledMammal (talk) 19:20, 25 September 2023 (UTC)
Secondary or tertiary sources are needed to establish the topic's notability You will note that this is talking about notability, not verifiability. Thryduulf (talk) 19:34, 25 September 2023 (UTC)
The full line is Secondary or tertiary sources are needed to establish the topic's notability and avoid novel interpretations of primary sources. And you'll note that this discussion is about when to have an article, not just verifiability - and I'm genuinely confused why you thought it was limited in that way. BilledMammal (talk) 19:56, 25 September 2023 (UTC)
This was a response to @JoelleJay asserting that Secondary sources are a requirement for NOR, which they are not. As for the full quote, I addressed that in my previous comment when there is no verification or interpretation happening a primary source[] is fine. Thryduulf (talk) 20:31, 25 September 2023 (UTC)
And to comply with our policies on origional research, articles must have secondary sources per the quotes I provided. JoelleJay's statement appears to be correct? BilledMammal (talk) 20:38, 25 September 2023 (UTC)
NOR could be read to require some secondary sources but it does not disallow an article that has a mixture of primary and secondary sources. JoelleJay's statement can be read as a claim that NOR entirely disallows primary sources. And the phrase "no verification" in Thryduulf's post makes no sense to me; it does not violate NOR to provide two citations that support the same claim, one of which is secondary and the other primary. Jc3s5h (talk) 20:53, 25 September 2023 (UTC)
Reading my comment again, I think I meant to write "synthesis" rather than "verification". An article consisting of entirely primary sources with no synthesis or interpretation of those sources is entirely compatible with NOR. In context, I read JoelleJay's statement as saying that NOR requires all articles to have secondary sources, which it does not. GNG requires secondary sources to exist, but doesn't require them to be present in the article (or even easy to find). Thryduulf (talk) 21:19, 25 September 2023 (UTC)
What, should I have said "non-primary" instead of "secondary"? If an article cannot be based on primary sources, and shouldn't even have large passages based on them, then of course non-primary sources are required by NOR? And if non-primary sources are needed to establish notability, then how can an article that has zero non-primary sources be established as notable? JoelleJay (talk) 23:26, 25 September 2023 (UTC)
@FOARP: "Likely"? Really? Do you have any data to back that up? AFAIK the errors made by the editors you cite were a small fraction of even their own output, never mind the hundreds of thousands of similar articles created by editors who have never made any waves. – Joe (talk) 16:04, 16 September 2023 (UTC)
The errors in GNIS and GEONet Names Server are par for the course - and discussed at length in the links in the background section. FOARP (talk) 21:18, 16 September 2023 (UTC)
Yes, and those two databases are specifically excluded by WP:GEOLAND (but not your proposed new version). – Joe (talk) 13:01, 20 September 2023 (UTC)
Those sources are already addressed at WP:RSP; while it's probably a good idea to keep the restriction in NGEO as editors creating articles based on it sometimes forget that other policies exist and apply, removing it won't permit editors to create articles based on them. BilledMammal (talk) 13:06, 20 September 2023 (UTC)
Because it is absent of context. What other villages exist in County Y, Province Z? How are they distributed spatially? Is P around the average population of this set, or an outlier? If I have to navigate to a different article to understand basic questions like these, I'd feel comfortable characterising such a stub as low quality. How is it high quality? Folly Mox (talk) 15:32, 16 September 2023 (UTC)
We're discussing notability, not quality, and one of the important reasons why you might have a stand-alone article for town X in county Y in province Z even though there's nothing else written in the article is because you may be able to link it to other articles in other Wikipedias where you may be able to find more information. I just looked up a place in a foreign language Wikipedia that doesn't have an English language article - there should be nothing wrong with a little stub in English on that particular place. SportingFlyer T·C 08:39, 17 September 2023 (UTC)
Our notability guidelines shouldn't be encouraging the creation of low quality articles; they particularly shouldn't be encouraging the creation of articles that are so low quality that they violate policies like WP:NOT and WP:OR.
Our purpose isn't to be a directory to foreign language articles. BilledMammal (talk) 09:03, 17 September 2023 (UTC)
What in that policy actually rules out being a directory of foreign-language articles? It says that Wikipedia is not a directory of everything in the universe that exists or has existed; the set of things that have Wikipedia articles in other languages is obviously more restricted than that. None of the specific no-no's (genealogy for the sake of genealogy, product pricing, etc.) either apply directly or are analogous enough to be applicable. I know that Wikipedias in different languages have different inclusion standards, and just because the Ruritanian WP includes an article doesn't mean we have to have an exact counterpart, but geographic locations are a case where parallel pages do make a considerable amount of sense. At the very least, WP:NOTDIRECTORY isn't a knock-down argument against it. XOR'easter (talk) 22:20, 17 September 2023 (UTC)
Wikipedia articles are not simple listings without contextual information showing encyclopedic merit. If the only justification for an article like this to exist is to link to external resources, then its existence violates WP:NOTDIRECTORY as it is nothing more than a simple listing without contextual information showing encyclopedic merit. BilledMammal (talk) 22:35, 17 September 2023 (UTC)
With NGEO though we've decided through long standing editing practices that the entire set of populated places is notable. The NOTDIRECTORY part you quote links to LISTCRIT which says selection criteria should be unambiguous and supported by reliable sources. While this isn't a list, Giddawa qualifies for both of those things. Unlike a cricketer of questionable notability, it's easily verifiable, and we're not improving the encyclopaedia by deleting it. SportingFlyer T·C 09:22, 18 September 2023 (UTC)
I think you misunderstand the point I'm making; I'm merely saying that links to foreign language articles isn't a justification to keep articles like the one I linked, per WP:NOTDIRECTORY, not that there aren't other reasons to keep it. BilledMammal (talk) 09:26, 18 September 2023 (UTC)
@SportingFlyer - “we've decided through long standing editing practices that the entire set of populated places is notable.” - This simply isn’t true. There have been large-scale deletions and redirection of verifiable “populated places”, because those “populated places” turned out to be statistical artefacts (eg the Iranian Abadi articles about pumps, bridges, factories, farms etc.). The GEOLAND standard requires “legal recognition” and doesn’t apply to every populated place - the problem is this is still over-broad and requires different coverage from one country to another depending on differing legal systems. FOARP (talk) 06:13, 28 September 2023 (UTC)
Joe, I've seen you around, you're no idiot. Anyway: my motivation is that obscure articles lack scrutiny, so the risk of inaccuracies is too high. Contributions aren't vetted, a-la Britannica. Given our natural lack of formal review, and our dearth of contributors, I think maintainability is a prime concern. We're the 7th most visited website, and false info entails real-world harm, including citogenesis, which makes inaccuracies very hard to fix.
According to WP:STUB, stubs should be "capable of expansion". I'm not trying to make an appeal-to-authority ("appeal-to-guideline"?), but it raises interesting questions: when people mass-create articles, do they take into account expansion potential? If not, are non-expandable stubs desirable? And given that non-notable articles need to be deleted eventually, can't we avoid their creation in the first place, saving the time of both the creator and the AfD-nominator?
Months ago BilledMammal showed that stubs are rarely improved by anyone other than their creator. So our long-held assumption, that stubs are a way to incubate articles until someone improves them, and that "stubiness" is temporary, is no longer true for many stubs. Wikipedia's changed, and the low-hanging fruits are gone; why shouldn't our practices adapt? DFlhb (talk) 16:04, 16 September 2023 (UTC)
Thanks for explaining. I asked you to treat me like an idiot because reading statements like yours (you're far from the only one), I genuinely feel like I am missing some fundamental piece of the puzzle. It simply does not make sense to me that creating stubs can be a problem in and of itself, and yet many people seem to state this as if it is self-evident. Rather than assuming that you're stupid (or I am), I assume there's a more basic misalignment of views underlying this disagreement.
Reading your response, it definitely seems like a big factor here is whether or not you feel like Wikipedia is in a consolidation phase, struggles with maintainability, and/or cannot tolerate more 'potential articles'. I've been around here a while, created a few dozen articles, and have a list of ones I'd like to create long enough that I'll never realistically finish it, so I'm a bit more optimistic on that score. Unfortunately it does sound more like a difference of perspective than something we'll come to an agreement on. – Joe (talk) 16:38, 16 September 2023 (UTC)
Right; it's a difference in views/approach, not something that can be easily worked out. And maybe I'm wrong on all this; still trying to read around, and open to changing my mind. DFlhb (talk) 17:53, 16 September 2023 (UTC)
There's way more low-hanging fruit than you think. (My current drafts and protodrafts include a Pulitzer winner, multiple recent popular works of media, and a major active political figure in an enwiki market -- and that's just what's redlinked, not what's in bad shape.) The trouble we have is not that we're in a maintenance phase -- quite the opposite -- but that we don't have the manpower for the expansion we need. The myth of the maintenance phase makes this worse, because people who've been exposed to it in the first place are people who are already part of the project so should be expanding articles. This is one reason I keep banging the drum of no seriously Wikipedia is not paper, in that how something is covered can change on a regular basis and that's a sign of the project functioning well. Increasing notability thresholds specifically is the worst of all worlds, because it decreases rather than increases the opportunity for editorial judgement on how to cover things that may or may not have article potential. Vaticidalprophet 21:24, 16 September 2023 (UTC)

@Joe Roe: I also find the idea of rate-limiting article creation to be unfortunate: it runs counter to what the readership imagines to be the ethos of Wikipedia. But I can also see how mass creation of stubs, if not well-planned, can have the same effects as mass vandalism, namely, a loss of trust on the part of the reader and a loss of morale among editors. On some other wikis, Lsjbot is the logical conclusion to this form of vandalism.

What I struggle with when evaluating this proposal is that we're treating the subject's notability as a proxy for the article's quality as a proxy for the editor's behavior. How am I supposed to explain to a prospective Wikipedian that they should not write a stub about a real place until they've called around about offline sources to prove that an article "can be developed using known sources", and that this hurdle exists because some people once misused databases to run rampant over the site? This is why we can't have nice things?

To me, the turmoil over databases like GNIS (the U.S. federal government's official gazetteer) actually shows that an individual basic error can persist for a long time in a prominent, somewhat mission-critical reference work without causing the sky to fall. We want Wikipedia to be better than that, but not at the cost of cutting off the motivation for people to transform themselves from readers to editors.

My other big hobby is contributing to OpenStreetMap, which has somehow managed to institute an automated edits code of conduct that requires getting buy-in from the community. This policy is the lodestar of mappers who identify as "craft mappers" and view automated edits with varying levels of suspicion, while the people who fill in the map at scale often chafe at the red tape. The policy is as robust as Swiss cheese, but it does send a message to people in a hurry that community health comes first.

Nowhere in this calculus is the idea that a contributor cannot micromap some kind of map feature in detail just because someone once botched a bulk import of it (which does happen, to be sure). Contributor behavior doesn't fundamentally change the nature of the real world that we document. A wiki map is very different than a wiki encyclopedia, but regardless a wiki is truly defined by what it can be, more than what it cannot be.

 – Minh Nguyễn 💬 00:55, 17 September 2023 (UTC)

(I moved a couple comments here up to the survey section, as they appeared to be intended as votes rather than comments. If I am mistaken, please feel free to move them back.) – Minh Nguyễn 💬 17:35, 20 September 2023 (UTC)

You'll find that you can write a fleshier stub on most legally recognized populated places. Even if it's map observation. The problem is the sub stubs where all there is is "xxx is a village" and you do a google search and no population data for it, nothing but databases. In those cases, I absolutely support the proposal to encourage merger into tabled lists. What about an Option 3. Legally-recognized populated places are generally considered to be notable, but the creation of short stubs where there is nothing but a database mentioning location online is strongly discouraged. The minimum requirement expected is a population figure and basic location details as a precondition for a stub (like Aung Myay), where the stub can be fleshed out to resemble an encyclopedia article instead of an xxx is a village type database substub. In cases where there is no population data and it resembles a database, consider merging into a tabled list.♦ Dr. Blofeld 12:02, 22 September 2023 (UTC)

A tale of two countries

Reading the votes so far, I observe that there's a commonly held fear that the proposed changes would insert extra scrutiny of cited sources into the process of assessing notability. Whereas currently, there is of course an expectation of reliable sources, but that's orthogonal to the subject's inherent notability to the extent that we can ascertain it. I'm also surprised that relatively few concrete examples have been explicitly named.

On the bright side, the United States will probably be OK, though many civil townships, census-designated places, and unincorporated places will need to be expanded, probably using information from (non-census) databases, to rescue them from deletion. Since the guideline will no longer explicitly discourage GNIS, it will be quite handy for this purpose. I bet most who like to conjure up the GNIS boogeyman don't know that the database very often includes historical details cited to reliable sources, if you can resolve the source codes. For example, our entry on Rudd, Iowa, omits the most interesting thing about it, the story of how it got its name. [15] Meanwhile, our encyclopedia is completely oblivious to Howard, Idaho [16], and Green Castle, Johnson County, Iowa. [17] But under the existing guideline, these places would have a decent shot at GNG anyways; the sources listed in GNIS are just the tip of the iceberg.

Not every country is so lucky. Turning to the other country that I write about, Vietnam, I certainly wonder if readily available online sources will be adequate for a place like Tân Hà commune (equivalent in stature to a township in the U.S. or a parish in England). The article already has a good start; it just needs a Vietnamese Wikipedian to translate to English. One need not worry that GeoNames has hallucinated this place. Otherwise, how would the infobox include a photo of the Tân Hà Commune People's Committee headquarters (equivalent to a town hall)? But it contains very little text "beyond statistics, region, and coordinates" and relies almost entirely on a government gazetteer and official website. The one addition is its administrative history, citing the relevant government edict (a primary source). Digging around online for a hot second, all I could find are content mills punctuated by the occasional news article that, if included, would surely provoke howls of WP:RECENT and WP:TOPIC and WP:COATRACK regarding this 40-year-old commune. [18]

Since this is a region that has been settled for millennia, there's likely a wealth of information about this commune's administrative predecessors and the local human geography in general. Much of it will be offline, perhaps in rare books collections in Vietnam, France, and the Vatican. Some of it will be written in chữ Nôm and thus difficult to search for. All of this information can be added in due time. Unfortunately, it will be even more difficult to find sources that technically pass muster as independent secondary sources. This is a country that, by law, lacks an independent press or publishing industry, so the kinds of sources we often take for granted may not exist, through no fault of the community in question. Honestly, under a strict reading of our guidelines, the best chance this place has is that a tourist from the West decided to write about it (most likely misspelling its name).

Other than the photo, this is quite typical of a Vietnamese commune article, at least to start out with. The Vietnamese Wikipedia originally banned articles on communes and everything below them, in order to encourage editors to write about districts and provinces first. But in 2007, the community decided that the ban was counterproductive and voted to allow communes under a guideline very similar to the existing one here. I'd imagine that Vietnam is not nearly the only country that would experience difficulty meeting the standard proposed here, even as the particulars may differ. I'm not entirely sure how I'll vote. Either way, I think the guideline will still be an unfinished one, subject to the usual inconsistent interpretations.

 – Minh Nguyễn 💬 00:35, 20 September 2023 (UTC)

If y'all could decide soon, I getting ready to start on 1.6 million new permastubs on villages in China and India which qualify under the current N:Geo that don't have an article yet. After that I'll move on to the electoral discticts. :-) Sincerely, North8000 (talk) 17:51, 20 September 2023 (UTC)

Sounds like we need to agree on a rate limit on permastub creation by @North8000. ;^) Minh Nguyễn 💬 17:59, 20 September 2023 (UTC)
And spoil my fun? :-) At two per minute going 24/7 it will still take 1 1/2 years to make them but they would live forever because it would millions of person-hours at AFD to get rid of them. :-) North8000 (talk) 21:14, 20 September 2023 (UTC)
Ah, but we could set up a bot to delete any articles created by you. Sure, there might be some collateral damage (like this B-class article with ~300 pageviews this month) but it be worth it. Probably. Edward-Woodrowtalk 21:18, 20 September 2023 (UTC)
Not so fast - I picked 12 at random and was able to expand all of them to Start Class by adding postal code, area code, distance to the nearest airport and a brief history of the province in which they're located! –dlthewave 22:03, 20 September 2023 (UTC)
If there's anyone notable from the village, one can just copy I mean, "summarize" their article, probably doubling the length! You're right, there is potential for growth. We could also transliterate the village name into various languages. Edward-Woodrowtalk 22:07, 20 September 2023 (UTC)
My dog chewed that patch up after I took the picture for Wiki, so Wiki got me to create my only memory. I was also riding high when SS Edmund Fitzgerald was the article of the day but a making zillion permastubs is is appealing enough to make me give up collecting string. :-) North8000 (talk) 11:56, 21 September 2023 (UTC)
Hey, at least WP:N8KSTUBS will give China and India something to agree on. Levivich (talk) 22:39, 20 September 2023 (UTC)

The present standard means we are literally publishing articles about non-notable houses

Not just that, but articles explicitly pointing out that the people who live in them live alone, and a long way from any neighbouring settlement. See Novino for one example chosen from many of single-occupant "villages" in Russia that were mass-created based on Russian census data. But it's a "legally recognised populated place" since it's a counting-unit in the Russian census, right? Does this strike anyone else as blatantly wrong, ikky, and potentially dangerous? FOARP (talk) 10:30, 27 September 2023 (UTC)

Does this strike anyone else as blatantly wrong, ikky, and potentially dangerous? Not really, especially if we approach this without the appeal to emotion. The content would probably be better merged somewhere (but I don't immediately know where) but that's not really your point. More importantly, your issue seems to be (again) with the mass creation of articles you don't like. Thryduulf (talk) 10:44, 27 September 2023 (UTC)
It's not just an appeal to emotion - the danger in advertising the location of random single-occupant houses and saying how far they are from potential help is obvious. FOARP (talk) 11:17, 27 September 2023 (UTC)
the danger in advertising the location of random single-occupant houses and saying how far they are from potential help is nothing other than an appeal to emotion. Per Joe below it is also factually incorrect. Thryduulf (talk) 11:35, 27 September 2023 (UTC)
What part of the description is wrong except that it apparently now is unpopulated? We have many other such articles showing a single-occupant miles from anywhere. FOARP (talk) 12:15, 27 September 2023 (UTC)
FOARP, while I've occasionally been frustrated by the way you've gone about trying to change policy in this area, I generally think you've got a really sharp analysis of notability and WP:NOT, which has changed my mind more than once. So please don't take it the wrong way when I say you've gone absolutely off the fucking deep end trying to rationalise this. Are you seriously suggesting that there are nefarious people trawling the English Wikipedia looking for isolated houses in Central Russia? Or that this something that someone who's chosen to live alone in the middle of the forest is going to be worried about? You don't like stubs about obscure places. That's okay – others disagree, but it's okay. You really don't need to tie yourself in knots trying to come up with justifications for it. – Joe (talk) 12:09, 27 September 2023 (UTC)
"Are you seriously suggesting that there are nefarious people trawling the English Wikipedia looking for isolated houses in Central Russia?" - would you be OK with Wikipedia having such an article about your house? Or would precisely these concerns come to your mind? FOARP (talk) 12:17, 27 September 2023 (UTC)
It wouldn't make a blind bit of difference, because people can already find my house by looking at a map. Or walking down the street. Or asking the postman. I'm utterly baffled at why you would think that the location of human settlements is a secret. – Joe (talk) 12:48, 27 September 2023 (UTC)
Being a secret or not does not mean we should publish articles publicising conveniently for anyone who might be interested (which, being the internet, means everyone, including criminals) the locations where people live alone in isolated locations. Why are we publishing articles that are about single houses, simply because they are mentioned on a census - something never intended to be used as a list of "villages" - as a line-item? FOARP (talk) 12:57, 27 September 2023 (UTC)
These houses are isolated because they are farms which need to be surrounded by agricultural land rather than other houses. Most of the articles don't even have co-ordinates. There is a very low risk of anyone abusing Wikipedia to research the best place to steal a turnip. Certes (talk) 13:04, 27 September 2023 (UTC)
Which of course leads us to many of these places actually being businesses/organisations... FOARP (talk) 13:07, 27 September 2023 (UTC)
The addresses of businesses are even more readily discoverable than the addresses of individuals, so your problem that isn't a problem is actually even less of a problem. Thryduulf (talk) 13:33, 27 September 2023 (UTC)
That's a problem for a separate reason; NCORP, not NGEO, applies. BilledMammal (talk) 13:44, 27 September 2023 (UTC)
To expand on what BM just said: the issue here is that business/organisations can also be the reference point used by the census. So, if a farm, factory, school or whatever is used as a reference point, hey presto! We have an article about a totally non-notable organisation without the need for any secondary sourcing at all. And yes, this does happen with Russian census data - not just farms, but factories (like "Metallist" in the same district as Novino) and schools (like "Shkolny" - literally "school" - again in the same district as Novino and apparently just the school for the local village). FOARP (talk) 14:26, 27 September 2023 (UTC)
According to ru:Новино (Кольчугинский район), it's actually no longer populated. However, in the past it was larger, peaking at a population of 103 in 1905, and there are historical records of it going back to at least 1637. In other words, it has a verifiable history longer than most US cities. With access to offline Russian sources like local newspapers, I'm sure you could write a quite lengthy article about Novino. We can at least say that it's definitely a (former) village, not a house. – Joe (talk) 11:02, 27 September 2023 (UTC)
See, this is the problem with machine-translating a term like "деревня" and saying "ah, that's a village". In reality, it can also be translated as "hamlet" and only ever referred to the smallest variety of rural settlements. The historical records referred to are to a single person's property - a single line-item. For "local newspapers" I would be very surprised actually if there is any coverage at all of Novino per se - but if there were that should be produced in the article for it to be kept. Single-line-item listings in census surveys do not create anything that can actually be written about. The reference to US cities is random - perhaps you could explain why we should discuss US cities here? FOARP (talk) 11:28, 27 September 2023 (UTC)
I can read Russian well enough without machine translations, thanks. Whether you call it a village or a hamlet, it's a human settlement which, as our current policies wisely recognise, tend to have deep and well-documented histories even if they're (currently) small or abandoned, and even if those sources are hard for English Wikipedians to immediately lay their hands on. I assume (or hope!) it's uncontroversial that we should have an article on major US cities, hence my observation that even tiny villages elsewhere in the world can have a more enduring notability. – Joe (talk) 11:56, 27 September 2023 (UTC)
I can understand your concern about systematic bias, but the fact is that we still have to actually have at least some of that "deep and well-documented history" to write an article. We cannot simply make stuff up based on nothing more than a number in raw census data and use that to support an article whose existence implicitly suggests the existence of an entirely speculative history - that is textbook WP:OR. It is equally possible that eg. the numbers you are pointing to are an error in data input (not uncommon when dealing with raw data, and a major reason why we should not be using it as the sole source for entire articles.) --Aquillion (talk) 09:43, 28 September 2023 (UTC)
One follow-up statement here: Russia is every bit as much a settler-colonist state as the United States. New York (founded 1624) is centuries older than some major Russian cities, including Vladivostok (1860), Khabarovsk (1858), Magnitogorsk (1724) etc. Even St. Petersburg was founded in 1703.
The concept that individual farms in Russia should have articles about them because of some (undemonstrated) village-history, and are all or even mostly ancient settlements, just because Russia is located in Eurasia and not the Americas, doesn’t appear in evidence. FOARP (talk) 23:56, 29 September 2023 (UTC)
And if the history you speculate about based on census data actually does turn out to be true, and someone actually the sources you speculate might exist, we could, one day, have an actual article using them, based on actual WP:RSes sufficient to pass the WP:GNG. But without that we don't actually have enough to write an article; the fact that you had to resort to flat speculation and (bluntly) WP:OR based on census numbers shows why we shouldn't use that sort of raw data as the sole source for an article in the first place. It is possible that your speculation is correct and that this is an actual village with an actual history; it is also possible that the census figures represent something else entirely or are simply inaccurate, as such figures often are. Without better sources we cannot know for sure and therefore we should not write an article that plainly (as you implicitly acknowledge) implies to the reader things that the sources we have available do not actually support. --Aquillion (talk) 09:43, 28 September 2023 (UTC)
I'm an inclusionist but we must have limits. My house doesn't have an article, because it's not notable, and nor are those houses. I came across this problem when listing disambiguation pages that might need to be created. One example (typical of many dozens) is Davydkovo, a name shared by 1 2 3 4 5 other places, none of which have a population above ten. It is difficult to see how these articles benefit Wikipedia. They are non-notable topics, which get a free pass to article status because they are places and we have an axiom that every place is notable. It's time to challenge that axiom. Certes (talk) 12:07, 27 September 2023 (UTC)
@Certes - Agree. On the specific point of mass-creation of articles about identically-named places, often the issue is that the census has divided an existing settlement into more than one listing. For example in the same district as Novino there is Litvinovo and Litvinovo (settlement), Vladimir Oblast - which is the real settlement here? Probably neither as a glance at the map appears to show them as neighbourhoods of Kolchugino.
But they appear in the census so... presumed notable? But that's how our present standard works. FOARP (talk) 12:38, 27 September 2023 (UTC)
Yes, the creator seems to be following our current guidelines, which is why I've been tidying up the same-name confusion rather than proposing for deletion. In case it's of interest, I'm running User:Certes/Reports/Qualified titles lacking disambiguation one initial at a time. It's currently on L, which is atypical due to "List of..." false positives, but the intro links to A–K results. After A–C, I realised that most of the work was on Russian places too tiny to be the most important pages I could improve on Wikipedia, so I started skipping them. (Now Turkey dominates, but those places have populations in three figures rather than one, and there are often surnames etc. to include in the dabs.) Certes (talk) 12:55, 27 September 2023 (UTC)
Ah yeah, the Turkish Mahalle/Neighbourhood articles. We even had an AFD that decided to up-merge the ones Lugnuts created, which was carried out, and of course the up-merging has since been reverted with GEOLAND used as the reason. FOARP (talk) 13:05, 27 September 2023 (UTC)
FWIW, I don't like the idea of going after all the articles created by Nikolai Kurbatov, who states "I am a disabled with bipolar disorder. The last years i have constant depression, which destroys the desire to live, and daily productive work is the only joy in my life." BeanieFan11 (talk) 12:13, 27 September 2023 (UTC)
Wikipedia is not therapy, however much sympathy one may have for that editor (who I have refrained from naming for exactly this reason). FOARP (talk) 12:19, 27 September 2023 (UTC)
I also noticed that comment but intended to refrain from naming names. I am concerned about the effect deletion might have on authors. It's tempting to sacrifice Wikipedia's usability to provide therapy where it's needed but that's a slippery slope, especially if two editors' needs require Wikipedia to bend in different directions. Certes (talk) 12:31, 27 September 2023 (UTC)

This reminds me of an administrative area that I once wrote about. For years, Mill Creek Township, Hamilton County, Ohio, had a population of just a mother and her son, surrounded by many dead people. If this article had been killed off for what was in its original revision, I don't think I would've ever known about the township in order to breathe new life into it.

Incidentally, as of 2020, there were nine incorporated cities, villages, etc. in the United States that had a population of zero. There may be a story behind each of them that's more notable than some incorporated places infinitely larger. For example, in reality, South Park View, Kentucky, is just a nondescript office park, but it's legally recognized by not only the Census Bureau but also the governor's office as a city. This is an objectively unusual phenomenon worth documenting.

 – Minh Nguyễn 💬 09:13, 28 September 2023 (UTC)

It's possible that a few of the Russian hamlets may once have been thriving cities. Anyone who finds a reliable source saying so is very welcome to apply for a REFUND or rewrite the article. However, in the absence of any indication to the contrary, it seems much more likely that each article represents what is, was and has always been a farm supporting one family and zero notability. Certes (talk) 09:55, 28 September 2023 (UTC)
Precisely this. We have many, many geographical stubs for which there is a prima-facie presumption that they have never been genuine settlements (e.g., ones named after the thing that they are - e.g., "XXXX's farm", "YYYY station", "School", "Paper Works") but are maintained under the presumption that, since they are or were line-items in a census, then they must all be Nineveh & Tyre, each a Troy waiting to be discovered by archaeologists. FOARP (talk) 13:00, 28 September 2023 (UTC)
How many, roughly speaking? Under the proposed changes, how many days would it take to delete them via AfD? Minh Nguyễn 💬 17:28, 28 September 2023 (UTC)
We have 3,668 stubs on places with a population of 0–9. Some of those should be retained, and many articles not in that set could be deleted, but it's a ballpark figure. Certes (talk) 18:55, 28 September 2023 (UTC)
So it'll take at least a year roughly. Is the proposed change to the notability guideline, coupled with the proposed rate limit on AfD, the most effective vehicle for addressing the BLP-ish privacy concern urgently raised at the top of this section? Minh Nguyễn 💬 19:05, 28 September 2023 (UTC)
No; notablity and privacy are unrelated. If there is a concern (I'm unconvinced) then it should be addressed urgently and separately whether the offending text is in Obscurokov or London. Certes (talk) 19:16, 28 September 2023 (UTC)
In the abadi clean up we deleted 4000+ zero-population “village articles for Iran alone. FOARP (talk) 00:02, 30 September 2023 (UTC)
I was mindful in particular of the Russian selo articles in my !vote above. We should not presume it was always a sub-hamlet, a single house, or whatever it is now; that's classic unconscious bias. The "places" that were administratively created out of railroad stations are a problem that should be dealt with after a specific discussion about such places. As in the case of the Iranian abadi, which I referred to. Yngvadottir (talk) 02:49, 3 October 2023 (UTC)
This is why GNG will always be a better standard when we're talking about a class of articles thousands strong where the best we can say is that sourcing might exist but have zero evidence that it does. For Wilton, North Carolina, an unincorporated community, the only historical info I was able to find was that it had a political meeting in the 1860s. Compare that to Moss Neck, North Carolina, a place of almost no significance now but had a nice history to uncover. As someone who has tried to dig up some information on small places like those at issue, I'm fully convinced that there isn't enough consistency in coverage for these exceptionally small places (and various artificial products of census keeping practices) to warrant making broad assumptions of notability. Joe's sweeping assertion that small "human settlement[s] ... tend to have deep and well-documented histories even if they're (currently) small or abandoned" is patently false. It's simply a matter of some do and many don't. Certes hits the nail on the head: "It's possible that a few of the Russian hamlets may once have been thriving cities. Anyone who finds a reliable source saying so is very welcome to apply for a REFUND or rewrite the article. However, in the absence of any indication to the contrary, it seems much more likely that each article represents what is, was and has always been a farm supporting one family and zero notability." -Indy beetle (talk) 04:39, 3 October 2023 (UTC)
Exactly this. Assuming that a place that is unpopulated or which does not have a substantial population was always like that unless evidence is produced to the contrary is not an example of “unconscious bias” - it is simply how logic works: the burden is on the person making the claim to prove it. Rather, assuming that every statistical item in a list was once a thriving village with its own newspaper is simply illogical when so many of them are very obviously railway stations, factories, mines, sidings, lumber-camps, barracks etc. etc. FOARP (talk) 08:04, 3 October 2023 (UTC)
The other thing is that failure to find adequate information isn't necessarily a permanent problem. If that information turns up, an article can always be written or recreated. It's more important for articles to be accurate than it is for them to exist. Mangoe (talk) 21:18, 3 October 2023 (UTC)

Notifications

The discussion above is closed. Please do not modify it. Subsequent comments should be made on the appropriate discussion page. No further edits should be made to this discussion.

Request for comment: Unreferenced PROD

The following discussion is an archived record of a request for comment. Please do not modify it. No further edits should be made to this discussion. A summary of the conclusions reached follows.
There is overwhelming consensus against this proposal. For a perennial proposal to succeed, something must be different about this specific iteration of the proposal, and that is not the case here. Supporters generally argue that unsourced articles are a Bad Thing, and that notion did find a wide degree of support. However, the opposition argued that unsourced articles can be fixed by editing, and thus should not be fixed with deletion. If it can't be fixed by editing (i.e. there are no sources that could be added to the article), opposers argued that this process is redundant to AfD and/or PROD. Some supporters also argued that the status quo amounts to a grandfather clause: new unsourced articles tend to get draftified, but older unsourced articles just sit around. This was squarely refuted by opposers, who noted that the proposal explicitly contains a grandfather clause exempting articles already tagged with ((unreferenced)) from this process. (non-admin closure) HouseBlastertalk 17:42, 9 October 2023 (UTC)


Should the process described in this discussion be adopted in some form; to wit, should unreferenced articles be subject to a PROD-like process where, after a two weeks (or other suitable time), the article is deleted? 19:11, 6 October 2023 (UTC)

In earlier times, Wikipedia needed to expand, to cover a variety of topics, so we could beat out competitors such as Britannica and Citizendium. That time has passed. We should now put more emphasis on quality, ensuring that our articles are trustworthy, well-written, neutral, and verifiable.
Obviously, the nominator of an unsourced article under this process should take time to conduct a quick WP:BEFORE; but again, the burden to demonstrate verifiability lies with the article creator. Article reviewers and other editors shouldn't have to break their backs scratching up sources for a now-destroyed theatre in Moscow.
This process would allow editors, using an ((unreferenced prod)) tag or similar, to nominate such unreferenced content (viz., articles) for deletion. These proposals would have the regular notifications to WikiProjects and user talk pages, giving involved editors plenty of time to add at least one reference that supports article content.
This would be a separate template from the existing ((unreferenced)) tag, so articles tagged with that template would not all suddenly be nominated for deletion. However, this would allow us to chip away at the backlog of 119 000 or so unreferenced articles more quickly, as the backlog would no longer grow – new articles would be tagged with the new PROD.
Given the two weeks to add at least one reference to the article, I find it unlikely that many articles would actually be deleted under this process.
WP:TLDR? A gloss of my 340-word comment: I think this proposal is an extension of our policy of WP:BURDEN. Article creators and interested editors are given two weeks – fourteen days! – to provide at least one reference that supports the article content. This would also help reduce the large backlog of unreferenced articles. An important point to note is that a separate template would be created; articles already tagged with ((unreferenced)) would not be all suddenly up for deletion. Edward-Woodrowtalk 19:11, 6 October 2023 (UTC)
@Shells-shells, Immanuelle, Phil Bridger, Kusma, James500, Espresso Addict, Pppery, Thryduulf, Red-tailed hawk, BilledMammal, Oknazevad, and David Eppstein: If you're interested here's an example:
  • I recently came across Chaimite, Mozambique, an unreferenced article. Initial searches turned up databases and Wikipedia mirrors/forks/scrapers, or passing mentions that only proved its existence. The was created in 2010, by the way.
  • If it was recently created, I would draftify it... but it's 13 years old.
  • Under this proposal, I would tag it with ((unreferenced PROD)), notify a WikiProject or two, and move on.
  • Given the phrasing of DEL-REASON 7, though, I will continue my attempts to verify the information, probably eventually taking it to AfD or regular PROD.
Edward-Woodrowtalk 17:02, 7 October 2023 (UTC)
Or I will spend effort the page creator should have spent getting the article sourced. Edward-Woodrowtalk 17:05, 7 October 2023 (UTC)
No, it is not effort that only the page creator should have spent. As I have said before the page creator does not own the article, and this is a collaborative project, so anyone with an interest in the subject, including you, should spend the effort. Phil Bridger (talk) 18:07, 7 October 2023 (UTC)
I think this is a fine example of our current system working properly. Someone created the article as a translation from a pt article that had a source, but failed to include it. Another editor came across it later and sourced it. Encyclopedia built. Espresso Addict (talk) 22:14, 7 October 2023 (UTC)
This seems helpful. Worth at the very least a redirect to the parent Chibuto District. Deleting articles like this would make our systemic bias worse. —Kusma (talk) 17:10, 7 October 2023 (UTC)
None of that explains why standard prod, AfD, merging or redirecting are insufficient. Especially for places in parts of the world less well covered by the English-speaking internet, "initial searches" are wholly insufficient to declare an article unsalvageable. Thryduulf (talk) 17:29, 7 October 2023 (UTC)
Exactly. This would make systemic bias a lot worse on wikipedia Immanuelle ❤️💚💙 (talk to the cutest Wikipedian) 17:58, 7 October 2023 (UTC)
@Edward-Woodrow: This proposal was explicitly stated not to apply retroactively. Under this proposal, you could not tag a 13-year-old unreferenced article as you propose to tag it. This is a bad example. The proposal has not even gained consensus and already we see severe WP:CREEP in it. —David Eppstein (talk) 18:15, 7 October 2023 (UTC)
No, it would not apply to articles already tagged with ((unreferenced)), something that article was not. Edward-Woodrowtalk 18:30, 7 October 2023 (UTC)
That's not a helpful distinction to make. We can apply this to old articles, but only if nobody has bothered to tag them for cleanup? Why? Another reason to oppose this proposal. —David Eppstein (talk) 17:17, 9 October 2023 (UTC)

I swear there was another difference, but my mind is blanked out at the moment. Edward-Woodrowtalk 21:53, 6 October 2023 (UTC)

By "established editors", you mean those with autopatrolled? If an autopatrolled editor is creating articles entirely without sources, then the autopatrolled right should be removed. Espresso Addict (talk) 22:04, 7 October 2023 (UTC)
The discussion above is closed. Please do not modify it. Subsequent comments should be made on the appropriate discussion page. No further edits should be made to this discussion.