Sunday, March 25, 2007

Today's paradigm of quality on Wikipedia

Today's punching bag article is "List of ministers of the environment". This is potentially interesting information, perhaps, but it is presented in such a mindwrenchingly bad way that it should probably just be deleted.

The main problem is the use of multiple unaligned tables. Each country's listings are in independent tables, which means the columns do not line up from section to section. This largely defeats the purpose of having columns.

In addition, significant amounts of information are being conveyed using shaded pastel colors. (This is actually how this article came to my attention: it abuses the {{yes2}} and {{no2}} templates to get the coloring. Formerly it used the {{but-yes}} and {{but-no}} templates, but that was recently changed when those templates were deleted.) This is unfriendly to color-blind readers; the red and green colors used for those boxes are visually indistinguishable to someone with protanopy.

There is an argument that concluding that some of these people qualify as "ministers of the environment" constitutes original research. At the very least, the sources that support identifying any particular position as a "minister of the environment" need to be more clearly associated with the individual claims.

Ideally, all of the information on this list would be moved into relevant articles as metadata and the list generated automatically from the metadata. This is a persistent problem with Wikipedia lists.

Citizendium plagiarizes?

It seems that Citizendium, despite its fancy claims, is not above plagiarism: compare this image on Citizendium with this image at Commons.  Note the stunning lack at Citizendium of any credit to Magnus as the source.  Not only is this copyright infringement, but it's morally dishonest for Citizendium and its contributors to take credit for work they did not do.

Shame on you, Larry.  I would have expected better of you.

The demographics of Wikipedia

Reading Geoff Burling's recent post on "Age and Wikipedia" got me thinking about age, and about other demographics, in the Wikipedia communities. However, I think the behavior Geoff is writing about is not entirely a symptom of age (although certainly many of the people expressing the behavior in question are teenaged boys, as I've commented on before), but rather of psychological characteristics that Wikipedia selects for, combined with the simple fact that there are a lot of teenagers (boys, mainly) with scads of free time to burn on the Internet.

A few days later, I then saw this, from Language Log, about a college freshman who credits Wikipedia for his passion for linguistics. And certainly Wikipedia does probably attract significant numbers of people who first discover Wikipedia while scratching some knowledge itch. I seem to recall that my first encounter with Wikipedia was due to an interest in mathematics. Of course, not nearly everyone who finds Wikipedia as a reference source goes on to be even so much as a casual editor, let alone a dedicated editor, and I doubt that anyone has anything better than a wild guess as to the conversion rates there.

And this brings me to the major annoyance in discussing demographics of Wikipedians: there are no meaningful demographics about Wikipedians. About the only subgroup of Wikipedians for which there is even a hope of meaningful demographics is that group which goes to Wikimania. The culture of anonymity there is so strong that many admins, and quite probably a majority of editors, do not reveal even basic demographic information about themselves. There are a number of voluntary surveys (e.g. the "list of Wikimedians by age" on meta), but any statistician knows that self-selected surveys are problematic at best and the response rate on these surveys is generally so low as to be useless for any meaningful purpose. Even so much as estimating the number of distinct editors on Wikimedia projects is hard, because of anonymous editing (which results in multiple people being difficult to distinguish) and sockpuppets (which results in a single person appearing to be multiple people). The English Wikipedia has millions of users, but a rather large percentage of them were created for the sole purpose of vandalism. The number of true, non-anonymous editors is simply not known, and is likely unknowable as well. And nobody can really agree on the best method to find reasonable estimates (most of which have to do with only counting editors who make more than some number of edits in some fixed length of time).

So, while it is "common knowledge" that "Wikipedia is run by high schoolers", there really is not any objective basis for this statement. At best, it's an intuitive guess extrapolated from very limited information. Certainly there are high school students involved with Wikipedia, but I think the above-linked article about the passionate linguist is proof of why this can be a good thing. Extrapolating from a few instances to the general case, however, is fallacious. I would love to see real, meaningful statistical data on the demographics of Wikipedia readers, contributors, and community members instead of the current mishmash of wild guesses, extrapolations, and outright hyperbole that is sadly passing for fact in such discussions.

Friday, March 23, 2007

On moving

I do not enjoy moving.  It is a very timeconsuming process.  It is a process that does not leave one time to write interesting things in one's blog.

Thursday, March 15, 2007

Notability, maintainability, and quality

Sage Ross reports (in his blog, at "Wikipedia and Notability") that the community is unhappy with the current definition of notability. I've touched on notability before, in the limited context of webcomics (see Webcomics and Wikipedia and On Webcomics, again). As Sage notes, "notability" has always been a contentious issue in Wikipedia, and there is indeed currently a dispute over what, if anything, "notability" should mean.

In the interest of disclosure, I will reveal that I am an eventualist inclusionist mergist. (I do not consider the latter two mutally contradictory.) My experience, in the somewhat over two years that I've been involved with Wikipedia, is that the scope of what constitutes "acceptable content" for inclusion in the encyclopedia has consistently broadened over time, although certainly in some areas (such as webcomics) there have been pushbacks. A good example of this trend must necessarily be high schools. When I first started at Wikipedia, in late 2004, very few high schools had articles, and most attempts to create one were met with a rather quick deletion on the basis of being "not notable". By 2006, it was generally accepted that high school articles were not subject to being deleted on the basis that they were "insufficiently notable", and today nobody (except for the most hardcore deletionist) contemplates deleting a high school article for very long. Similar trends have seen individual articles on every Pokemon, articles on individual episodes of various television shows, and all sorts of other content that would likely have been summarily deleted in 2004 become generally accepted as appropriate content in 2007.

This is, in my belief, largely due to the fact that the people who feel the urge to remove what they feel is meritless content are simply outnumbered by the people who would create such content. There has not, in most cases, been any conscious decision by the Wikipedia community (if in fact that entity is capable of making decisions, which I rather highly doubt) that articles on individual episodes of the Simpsons are appropriate for inclusion; rather, the articles were created by dedicated Simpsons fans, and nobody with an eye for trimming the encyclopedia got to them quickly enough to effectively resist their presence, and so they, by default, became part of the accepted corpus. I see no reason why this trend would not continue, and so I therefore expect that over time the margins of notability will continue to be pushed further and further back. I don't think that the margins will ever be pushed out completely to the point that (e.g.) the serial number of the dollar bills in my purse will merit their own articles (although it's not entirely out of the question, as many of them are catalogued already at wheresgeorge.com), but I think there's still a great deal of room for expansion and I expect to see Wikipedia expand into that space over the long haul.

The ongoing battle over webcomics seems to be the current exception to this trend, and I don't expect it to continue. Assuming that they don't give up, the webcomics fans will eventually win, as they simply outnumber the notability pruners. At the moment, the pruners are organized against webcomics, and they are assiduously defending that territory. However, the pruners are more subject to attrition in the ranks than the webcomics fans, and it is likely inevitable that too many of their faction will leave Wikipedia or be drawn off into some other battle (say, amateur sports leagues, or radio towers, or some other equally borderline area) and the resulting loss of active focus will let the webcomics fans win out. It's far easier, in most cases, to recruit people in favor of keeping content than it is to recruit those opposed to it.

So, rather than spending a lot of time refining the definition of notability, I would advise discarding it entirely. Notability is, in practice, is a proxy for a large number of largely personal beliefs about what should be in an encyclopedia for which there is no consensus within the Wikipedia community. Furthermore, those beliefs shift over time, and I believe that shift will tend toward broader inclusion over time. The problem with broad inclusionism is that it will inevitably lead to more articles than the Wikipedia community can effectively maintain. (It is difficult to deny that this has already happened.)

The problem with discarding notability is that immediately people will scream "But then we will have articles about what you had for breakfast yesterday". Well, no, we won't. (Although it might be interesting to have that data; I'm sure that there will be people in 2150 who will be interested in knowing about the dietary habits of early 21st century IT professionals. There are probably people in 2007 with that interest, for that matter.) I am not advocating having no standards at all; that would be irrational. Instead, the standards must reflect maintainability as the main consideration. A record of my breakfast yesterday (for the record, two glazed Dunkin Donuts and a bottle of Aquafina) is unverifiable, and thus unmaintainable, and thus unfit for inclusion in Wikipedia. Verifiability isn't enough for maintainability, but it's definitely a minimum characteristic.

This seems to be the general direction of the discussion that Sage refers to, although they're not characterizing it as maintainability, but instead attributability. I don't think attributability is enough. One of Wikipedia's largest problems right now is that it's larger than its community can effectively tend to. Wikipedia needs to aggressively limit its growth, at least in the short term, to give its community enough time to structure itself better to be able to handle the content it has now, to say nothing of the content it will acquire in the future. The problem that adopting attributability (or verifiability) as a minimum criterion for inclusion is that someone is going to have to check the cited sources for accuracy. Nobody is doing that now, except on a haphazard basis. Wikipedia has no process now for any sort of organized maintenance of the encyclopedia; even vandalism management is done haphazardly.

Quite frankly, I think it would be appropriate for Wikipedia to disable new page creation (except for admins, to deal with special cases) for an entire month and spend that month developing the infrastructure to better maintain both the articles it currently has and the new articles it'll gain once new page creation is reenabled. New page review needs to be systematic, not haphazard, and there need to be systems to ensure that every new page is looked at by at least one and preferably several experienced editors promptly after creation, both to properly categorize it (the stub sorters already sorta do this, but they do so in a far less useful way than they could) and to evaluate the article for what action the community needs to take with respect to it. And then the community needs to actually do those things.

There are currently 21,598 articles tagged as needing cleanup and 55,928 tagged as lacking sources. And I suspect that only represents about 20% of the articles that actually belong in those respective categories. These numbers are not falling with time; they are growing (a month ago, there were only 49,607 tagged as lacking sources). These backlogs reflect the rapidly declining overall quality of Wikipedia. The situation may already be out of control; if it is not yet, it likely will be soon. The problem is that the community largely seems not to care, and that really bothers me.

Deleting all unverified articles would be a good start. Not all at once, but a deliberate, systematic process to either source or delete those 55,928 articles would be a great start. Proper use of automation is critical to this, and I really think that's where Wikipedia needs to be concentrating its activities in the next year. It would be great if the Foundation would help to recruit the volunteers needed for this effort; the problem with the current community is that there don't seem to be enough people interested in this sort of work to get it done.

Cook County bans smoking

Cook County's public smoking ban went into effect today. This means that any place in Cook County not already subject to a community ordinance regarding smoking (basically, unincorporated Cook County and any city, village, or twon which hasn't bothered to pass a more or less restrictive ban yet) is now subject to the county's rather broad public smoking ban, which includes both bars and restaurants.

The Cook County ordinance is similar to the Chicago one, except that it does not have Chicago's "air scrubber" exception for bars: all bars are now nonsmoking. This means that Possum's Pub, a little hole in the wall bar a block from our house, might actually be appealing: it's in unincorporated Cook County, which means that they will be nonsmoking and there's no village council that can change that short of annexing the property, which will not likely happen.

Too bad the setback is only 15 feet. I'd have much preferred 25.

Wednesday, March 14, 2007

Do your bits have Colour?

Thanks to Kat to pointing out this rather interesting attempt to explain legal relations to computer geeks. I like the Paranoia reference.

Of course, he's not entirely right: Copyright law doesn't deal with things as small as "bits". Copyright law doesn't care about the details of the representation; as far as the law is concerned, files generated by Monolith aren't distinguishable at all from the original content; they're just encoded in a funny way. And encoding doesn't matter. So, really, it's not about whether your bits have the "copyright Colour" (which, according to this guy, only exists for lawyers), but really rather bits exist at all. The "I can't distinguish this file from random noise" is simply false: you can distinguish it from random noise by decoding it. If it were random noise, it would still be random noise after decoding, but it's not.

It is a truism that computer geeks routinely babble nonsense about copyright, usually because they use analysis techniques that are based on strict logicalism, instead of the goal-oriented legal reasoning methods that lawyers are taught and routinely use. In law, if a strict application of some rule leads to an absurd result, you ignore the result, and find some other rule to apply in that situation that doesn't lead to an absurd result. The law doesn't like absurd results (they're called "antinomies" in some older texts); judges tend to scoff at the idea that they should require something absurd and will generally find a way out if given the chance.

He does, however, get it basically right: "Colour" is not a characteristic of bit sequences; it's a characteristic of processes and of history. Ultimately, law isn't about things. It's about people, and more specifically the relationships between them. Nothing can be evaluated in a legal sense without knowing the context and history in which the evaluation is required. And that's a large part of why computers cannot, on their own, enforce laws: there is always the possibility of a context or a history which the designers of the "enforcement system" didn't envision. People have the flexibility to deal with that and move on; computers, not so much.

Tuesday, March 13, 2007

Is Wikimedia really committed to open source?

One of the principles that I was under the impression that Wikimedia follows is a commitment to use open source software whenever possible. As an organization committed to open content, the Foundation supposedly strives to use only open source software in its main production operations (and, exclusive of some router firmware, it does as far as I know). In addition, the Foundation is supposed to minimize its use of proprietary software for accessory uses, using open source options when available.

So, then, can someone tell me why Wikimedia has set up a Ventrilo server? There's nothing Ventrilo can do that Asterix can't. Furthermore Ventrilo only supports Windows clients, and charges a monthly licensing fee; Asterix supports any SIP client (of which there are dozens) and (being open source) has no fees at all. On top of that, Ventrilo is merely a voice chat server; Asterix is a full-blown telephony application, with voice mail, ACD, IVR, and basically everything you'd want in a regular PBX system, plus the ability to set up softphones (or, with minor hardware investment, hardphones) that work from anywhere.

I've been trying to convince the Foundation to set up Asterix for over a year now, if for no other purpose than for the occasional conference call, and to make it possible for Danny to forward calls to Florence without having to deal with international calling issues. I've apparently been ignored on this issue, however; and when the ComCom (or someone else in the Foundation) decided they needed voice conferencing that was less unreliable than Skype (which isn't saying much) instead of turning to a volunteer who has actual experience with voice-over-internet applications in the real world, they pick up a voice chat program that is marketed primarily to gamers.

I hope the Foundation hasn't spent too much on Ventrilo, or on the Windows machine that it's running on. I consider every cent spent on it wasted money, when a deployable open source solution exists and has existed for months that does the same thing and since it runs on Linux, wouldn't have cost a Windows install, either.

Really underscores Wikimedia's need for a proper CTO, doesn't it?

Sunday, March 11, 2007

Why do people edit Wikipedia?

I mentioned sourcing as the solution to the credentialing problem in my last post. And in thinking about some of what I wrote there, it occured me that Wikipedia's greatest problem has to do with "goal management" in the community. Wikipedia not only suffers from the lack of a meaningful goal statement itself (it defines itself as "a multilingual, web-based, free content encyclopedia project", but that doesn't really set measurable goals for the community), but also suffers because, in my expectations, the personal goals of many of its contributors do not align well with the overall project goal.

Really, why do people edit Wikipedia? And I don't mean "what draws people to Wikipedia". I want to know why people stay at Wikipedia. What addicts people to this site? And how does the personal pursuance of whatever personal wants these people are satisfying through their participation help make Wikipedia more of a "free content encyclopedia"?

I suspect that one of the major motivators is the desire to show off one's own knowledge. Quite a few people (including a number of the so-called "active contributors" who are currently claiming the right to VestedContributor status) are exercising this motive. And that's fine -- up to a point. There are at least two problems that come out of participants who are acting out of this motive. One of them is the tendency to claim ownership rights on one's own contributions. For these people, their contribution list, and especially their list of "featured articles" or otherwise specially-recognized content, becomes a substitute for penis size, and they thereby become unreasonably possessive of the articles they craft. They may also attempt to subvert the quality recognition processes in order to ensure that their own content is more likely to be favored with designations of quality, again in order to boost their own importance.

Another is the tendency to doggedly defend what they "know to be true" even when doing so is not defensible. Sometimes it's just due to lack of sourcing -- I could write a lot about computers, or electrical systems, or any of many other topics, based on my work experience, but I couldn't source most of it because I've learned what I know by osmosis, and don't have reference materials to back it up. Quite a lot of Wikipedia's content is of this origination. If Wikipedia had a dedicated corps of fact-checkers and fact-sourcers, this wouldn't be a problem; such articles would merely be treated as unverified drafts, presented as such, and eventually cleaned up to the next level. The problem is that that doesn't happen. The lack of such a corps is why the English Wikipedia has about a 9:1 crapticle:article ratio right now. As long as most of the content on Wikipedia is in the nature of "verbal diarrhea" (which so much of it is) this won't easily change.

Another major motivator is the urge to belong. Especially in the past year or so, Wikipedia has become one of the "places to be" on the Internet, and a large number of people have been attracted to Wikipedia for reasons that have rather little to do with Wikipedia being an encyclopedia. I have, in the past, referred to this as "playing the Wikipedia MMORPG". By and large, these people are teenaged boys, and virtually all of them are vandalism patrollers, some to the exclusion of all else. They have brought with them a number of undesirable dynamics: an expectation of hierarchy, a quest to achieve rank, a tendency toward dogmatic, unthinking application of rules, a propensity toward hazing those below them in their perception of the hierarchy, and a near-complete failure to understand the real purpose of Wikipedia in anything but the most superficial of ways. (That is to say, they usually claim to understand that Wikipedia is an encyclopedia, but have no idea what that means, and they have no grasp of board policy principles, only the specific statements made in individual policies.) I strongly suspect that PDD spectrum disorders are pervasive in this element. The sad thing is that these people are eminently useful to Wikipedia: they tend to be organized and follow clear process well, many are good at analyzing detailed data, and they generally do not mind doing repetitive tasks over and over again. We should put them to work sourcing articles; instead, we expect them to manage interpersonal conflict, which by and large they are really really bad at, and to deal with vandalism and trolls, which requires more diplomacy and discretion than the average member of this group can produce.

Another motivation, not entirely unrelated to the above, is to be important, powerful, influential, or even just noticed. I think this explains the Essjay situation to some degree. It can also explain a lot of trolling (since trolls typically do what they do in order to be noticed) as well as the addiction to drama that seems to pervade certain segments of the Wikipedia community.

Other motivations include the desire to overthrow the copyright system, or even to overthrow capitalism, a genuine interest in creating free content, and altruistic desires to create something of value for mankind. These people tend to be more likely to have an appreciation for the broader principles of the project. And they tend to run into trouble because they're the minority in the community.

The main failing, in my opinion, is Wikipedia's failure to erect a structure that can be used to channel the people in the second group into useful work. I am reminded of the death of the CommuniTree (see Shirky's article). The near-total lack of rules and structure on Wikipedia at the time these teenaged boys started flowing in meant that these children could not be channeled into useful work by an existing structure; had there been one most of them would have either flowed into it willingly or else rejected it and left. Because no such structure existed, they created one. And the one they created is designed not toward the purpose of creating a quality encyclopedia meeting certain fundamental core values, but instead toward the purpose of playing the Wikipedia MMORPG. Many of the current problems in the Wikipedia community is that there is a large, active subcommunity which views itself as having vested rights and which is pursuing goals that are at least somewhat at odds with the foundation goals of the project.

Fundamentally, Wikipedia needs to contemplate what the motives of its editors are, and find ways to discourage those whose motives are unproductive or which tend to work at cross purposes to the project's purposes. I don't think much effort has been put into understanding why people edit Wikipedia; in most cases we don't know at all, and where we have any clue it's based on voluntary responses to a survey, which often fails to reveal self-serving or otherwise "undesirable" motives that respondents don't want to admit to.

The conclusion I keep coming back to is that the wide-open unrestrained editing model that Wikipedia started out with simply will not lead to long-term reliable quality. Worse, the community structures that have evolved in recent months seem designed to entrench the current miasma of low-quality articles; I fear that without major change Wikipedia has reached the endpoint of its evolution.

It's a shame that Meatball is down at the moment; there's a lot of good stuff there that is pertinent to this discussion.

Saturday, March 10, 2007

Credentialism

So, in the wake of the Essjay scandal, the whole issue of Wikipedians claiming credentials has become a topic of rather a lot of discussion.

One of Sanger's long disagreements with Wikipedia has been over Wikipedia's failure to recognize credentials. Citizendum has long had policies designed to give those having credentials special status as editors; recently, Citizendum withdrew the "honor principle" regarding credentials in Citizendum: Citizendium editors will now be required to validate their identities and credentials. Larry denies that the Essjay situation was involved, which I think is not entirely true, but anyway....

Still, the main topic of discussion has to be Jimbo's proposal regarding credentialing. Although which one to discuss: this one, where he suggests that degree-holders might fax a copy to the Foundation office? (Danny runs screaming to the hills, dreading the thousands of faxes of Damnation University diplomas that would invitably result.) Or this older one? Or this one (which appears to be the most recent Jimbo proposal), which (horrors of all horrors) authorizes the creation of a "This user has a Ph.D." userbox, linked to a subpage that lets people describe their attempts to verify the holder's claims of their credentials.

I submit that Jimbo has completely missed the boat here. Wikipedia should not care if you're credentialed. Credentialing creates a stratified, classful editing culture based on something that doesn't relate all that well to editing competency. It's not a structure that will help Wikipedians write an encyclopedia -- although it is one that will help some people win petty combats over article content. The only good that verified credentials do is provide some vague smattering of reason to believe that when someone says "This claim is bollocks" they're right -- and Wikipedia already has a better way to deal with that, that avoids all the issues associated with credentialing: sources.

Quite simply, if you're going to assert that something is true based on your credentials, you should stop and instead assert that it is true based on a source -- which, as a credentialed academic, you should be able to produce with relative ease. This works even for statements which are "generally accepted by experts in the field" because such statements are invariably made in introductory-level textbooks in that field, which most academics in that field would be familiar with, or at least be able to put their hands on without much difficulty. This may make editing "less fun" because you can't just go "Hey, I know that" and spew out three paragraghs of unsourced claims that you know are true but can't convince anyone else of, but hey, nobody ever said that writing an encyclopedia was easy. Letting people get off without proper sourcing got Wikipedia to 1,678,137 crapitcles; getting Wikipedia to half a million good articles is going to be a lot harder. (Really does remind me of writing open source software: it's easy to throw together a hunk of code that does something vaguely interesting, but both much harder and much less fun to turn that into a finished, polished product, which is why so many open source projects never make it much past version 0.2 or so.)

This solves the credentialing problem without actually requiring anybody to provide or verify credentials. With this asserted as practice, the only remaining issue is how to deal with someone who asserts something based on a claim of "expert credentials". And that's simple: demand them produce a source.

So, Jimbo, please stop talking about credentialing. It's the wrong solution to a problem that doesn't actually exist, simply because the community has already solved it. The solution to Essjay's use of falsified credentials is not to provide a mechanism for verifying such claims, but instead to remind the community that such claims don't meet with Wikipedia's pre-existing verification requirements and cannot be used to win debate. Enforce the policy Wikipedia already has; don't make bad new policy just to smooth over a PR flap.

Wednesday, March 07, 2007

Yet another social networking site

Ho, hum. Yet another social networking site has apparently started. I won't tell you the name here; if yuo want to find it, go to "Mark's Moolah" and read the article there. I took a brief glimpse and it looks like it's mainly being used to promote webcams, gaming groups, and political groups. Surprise.

I don't see how this one is any different from orkut, friendster, or any of the other social networking entities. I have yet to see a social networking application that served any redeeming purpose.

Tuesday, March 06, 2007

On Webcomics, again

It does occur to me that a good way to improve relations between webcomic authors and Wikipedia might not include trolling.

Are all webcomic authors jerks, or just most of them?

Monday, March 05, 2007

Another SEO dude who doesn't get it

Someone on IRC pointed this article out to me. Apparently, this guy is arguing that Google has some sort of stranglehold on Wikipedia, because Wikipedia is the #1 external domain that Google users go to. He seems to believe that "any business that is overly reliant on a single supplier (who cannot be easily replaced) is in a very, very precarious position".

Like so many SEO/blogspace people, he doesn't understand Wikipedia. This is evident, simply from the fact that he calls Wikipedia a "business". Wikipedia, unlike probably every site this guy has ever contemplated, does not make money on every page load. Wikipedia, as strange as it may seem, does absolutely nothing to monetize its content. Every referral to Wikipedia from Google (or anywhere else) costs Wikipedia money, in the form of IP transit as well as server load. If Google wanted to ruin Wikipedia, it would shunt even more traffic toward Wikipedia, not cut it off. (Well, cutting it off might reduce donations, too, but the connection between page views and donations is tenuous at best.)

So, Mr. Hearne, may I suggest that the next time you blather on about the Internet, you actually research the sites you talk about? We'd all appreciate it.

The New York Times covers the Essjay story

So much for the hopes that it would get lost in the weekend news. Nope. They ran it in Monday's Technology section, where it will be read widely. All in all, it is a relatively softball story, at least.

It will be interesting to see how the rest of the major media react to this. I'm not making any bets.

Sunday, March 04, 2007

On Bureaucrats

In the aftermath of Essjay's resignation, a number of people have declared that "there is a need for bureaucrats", and three people -- RyanGerbil10, W.marsh, Durin -- have already jumped into the breach to close the purported "gaping hole" created by Essjay's departure. The first two have since withdrawn their candidacies, but Durin's continues. This situation raises several related issues.

First, is the thesis that Essjay's departure creates an immediate need for more bureaucrats. This claim is unsupportable. First, Essjay was never very consistently active; he would often disappear for days or weeks at a time. Second, there is nothing bureaucrats do that is time-critical. The main responsibility of bureaucrats is to determine consensus in requests for adminship. This is not in the least bit time-critical: if some admin has to wait 24 hours, or even a week, to be promoted, the encyclopedia will not be materially harmed. Frankly, any admin candidate who is so impatient to become an admin that he cannot wait a few days more for a bureaucrat to review his candidacy and come to a decision does not deserve adminship. The other two duties, user name changes and bot flagging, are equally non-time-critical. There is simply no urgency for the completion of bureaucrat duties, and no reason to believe that Essjay's absence from the ranks of the bureaucracy will materially affect the responsible and timely completion of these duties. Not enough time has passed to see if the other bureaucrats will rise to the occasion; and in any case the expectations of RFA candidates of instant promotion are quite unreasonable, and arguably even contrary to the best interests of the encyclopedia.

Second, is Durin's qualifications as a bureaucrat, and more generally what I, at least, consider appropriate in a bureaucrat. I opposed Durin's candidacy on the grounds that Durin is an "inflexible process wonk". And I really do believe that. It's well-known that I have a history of a rather nasty past conflict with Durin. And I freely admit that that history influences my vote, but what influences me more is all the other people who have also had conflicts with Durin. Durin is rigidly inflexible on what he has decided is right -- whether or not his opinions are justified by meaningful facts or sound logic. Once his mind is made up, he is impervious to other opinions. Furthermore, he is paranoid and overly sensitive. This combination of characteristics is the reason why he has been in so many acrimonious conflicts, and why he will continue to be in them for as long as he remains at Wikipedia. Durin is a pleasure to get along with as long as he agrees with you; as soon as he disagrees with you, though, the gloves go off. He shows clear evidence of "binary thinking" (perhap's he's a Bynar?). Unfortunately, this sort of rigidity in thought is exactly what Wikipedia does not need, and especially not in bureaucrats. I have complete confidence that Durin will rigidly apply an exact percentage rule of some sort (his addiction to statistics suggests that he's already analyzed every RfA in recent memory and established what, in his mind, the proper cutoff should be).

However, none of the above is my real reason for opposing Durin. I believe that bureaucrats should not seek to "determine the consensus of the Wikipedia community" by counting votes and applying them against some arbitrary yardstick. Rather, I believe that a bureaucrat should evaluate the candidate, taking into consideration all the comments made by those commenting on the request for adminship, and determine whether or not promoting this candidate is in the best interest of the encyclopedia. I realize that this is a revolutionary position on my part; it is directly contrary to the current declared practice. However, it is the practice I desire to see followed. I also firmly believe that Durin will oppose this to his dying day -- and if he ever becomes a bureaucrat, his ability to prevent that from becoming practice at Wikipedia will be that much stronger. Therefore, I oppose Durin's candidacy because I believe his elevation is not in the best long-term interests of the encyclopedia, and not because of his capability to perform the specific duties of the office.

Saturday, March 03, 2007

Essjay quits

Well, Ryan Jordan, apparently realizing that his goose is quite thoroughly cooked, has withdrawn from participation in all Wikimedia projects, including resigning all special roles and privileges he held. (He is also no longer employed by Wikia, if this is any indication.)

I won't say I'm happy, because I'm not. However, I am at least relieved that the "crisis" phase of this episode is over. There remains some considerable risk of an edit war or wheel war breaking out over his user page, user talk page, or the Request for Comments that was in progress regarding his conduct. I am also concerned about the number of people running around declaring that there's now a shortage of bureaucrats, checkusers, and oversights (many of whom are clearly preparing runs for one or more of these positions).

Rather than jockeying for the vacancies created by Essjay's departure, I think it is far more important for the community to discuss the issues raised by his conduct. Specifically, the Wikipedia community very much needs to discuss:
  • the role of experts on Wikipedia
  • how to deal with pseudonymous individuals who seek to claim expertise in a topic area
  • whether or not to allow pseudonymous individuals to hold positions of authority, and if so under what conditions
I personally think that anybody who claims any sort of expertise on Wikipedia for the purpose of influencing discussion or debate must be prepared to immediately identify themselves in such a manner as would be sufficient to allow any reasonable person to confirm their credentials.

Also, any person who, in the course of performing any special duties within Wikipedia or within Wikimedia, would have access to information which would be subject to the provisions of the Wikimedia Privacy Policy, must identify himself or herself to the Wikimedia Foundation before being given that access. The same should also apply to those who are authorized to speak for or on behalf the Wikimedia Foundation. (The Foundation should be required to treat such identity information consistent with the privacy policy, of course.) This would include, amongst other things, all checkusers, stewards, and OTRS volunteers. It would not include administrators or bureaucrats. It would also include Arbitrators unless the practice of sharing checkuser data freely with Arbitrators is discontinued.

In any case, the community needs to discuss these matters. It certainly cannot hope to sweep these issues under the rug if it hopes to get through this incident without permanent damage.

People who lie when apologizing for having lied...

Andrew Lih, in his recent blog post, exposes Ryan Jordan's worst lie of all. In his apology, Jordan, amongst other things, accused Stacy Schiff of breaching journalistic ethics by offering him compensation. I didn't notice it right away because I'm not a journalist and don't hang around with journalists. But Andrew (who is a journalist, or at least a professor of journalism) did. And he contacted Ms. Schiff, who categorically denies Jordan's allegations. This is actionable libel; by all rights Jordan's apology should be redacted to remove the libel, but he's a known liar anyway, so it's unlikely that anyone will believe it at this point.

That does it, though. At this point Jordan should be banned from all of Wikimedia, permanently. If he is going to lie in an apology for having lied, then there is no hope for him. Lying has apparently become so ingrained to him that he apparently cannot help but do it.

Sorry, Ryan. Get thee behind us all; go find help, you need it. And until you do, stay away. We've had enough of liars here.

Jimbo responds on Essjay

Jimmy Wales has made a lengthy statement regarding the Ryan Jordan (Essjay) situation. He has professed a lack of awareness of Jordan's misconduct, which given Wales is entirely to be expected. He has also called on Jordan to resign his positions of trust within the community. (No mention is made of his relationship with Wikia, but I don't care so much about that.)

This is certainly the right thing for Wales to do now, and for that I commend him. However, Wales really should have investigated whether Jordan's actions amounted to a breach of trust before appointing him to the Arbitration Committee. It was so widely known that Jordan had claimed to be a professor that surely Wales was aware of this. As Jordan's apparent supervisor at Wikia, one would presume that Wales was aware of the content of Jordan's work history and knew that he was, in fact, not a professor, and did not hold any advanced degrees. This alone should have caused Wales to at least wonder if he had misused his claim of academic credentials in some way. Surely Wales was aware that Jordan has misrepresented his credentials to the New Yorker; I cannot imagine that Wales failed to read that article.

In short, ignorance is not an excuse here, not when a reasonable person would have been suspicious. Wales, as a community manager himself, is obligated not to remain ignorant in the face of reasonable suspicion. He should have investigated; he apparently did not, and thus his actions led to this situation going from being merely embarrassing (the New Yorker thing would have broken eventually) to thoroughly damaging -- all because Wales appointed Jordan to the Arbitration Committee at a point where he (if acting prudently) would reasonably have known about Jordan's misconduct.

Lawyers have a catchphrase: "Trust, but verify". That is what Wales failed to do in this situation. His failure has harmed his own reputation, but even more so has harmed that of Wikipedia's. Hopefully, this will be a lesson for him, and for all Wikipedians.

Random funny comment on the Essjay situation

From Rcade's blog:
Bureaucracy's engrained so deep in Wikipedia that there's an official job called Bureaucrat. Twenty-three people have attained this level, a heady rush that must be similar to how I felt when my magic-user reached 22,501 experience points and became a Thaumaturgist before he was eaten by a bulette.
So captures the spirit of the Wikipedia MMORPG.

Friday, March 02, 2007

RealtyExecutives.com brags about WikiSpamming

This press release from RealtyExecutives.com is basically them bragging about their success at spamming Wikipedia.

I haven't looked to see if the article is still on Wikipedia; I just find it amusing that the press release reads like these people got their content through some sort of arduous approval process.

If only all spammers would announce their spaminations!

Why Jimmy Wales must dismiss Ryan Jordan from the ArbCom

From the comments at metafilter:
I've never edited on Wikipedia, but just earlier today I stumbled on a page with blatant errors and further questionable information (I need to research). I only noticed the errors because the article is related to the material I've worked with since the 80s. It's factual data, not subject to interpretation. Most people would read it and accept it as truth. Stealthy bad info is evil.

I'm swamped with work right now. Still, I was considering finding the hour or two it would take to research and do a thorough job of fixing that article (properly referenced to real, peer-reviewed publications). Hell, if I needed help I could even call one of the most knowledgeable people on the planet about that subject – a PhD well known in his field – who I've known and worked with for nearly twenty years.

I have no interest in becoming a regular Wiki editor and all of the ego games that seem to come with that. I'm just a person with some specific knowledge to contribute and the resources to do a good job. But after this shit I really don't feel like volunteering my time. I've got real publications to work on. If one of the PhDs I work with turned out to be a fraud, they'd never be allowed to publish through professional channels again.
If Jordan doesn't resign, and Wales doesn't force him to, this could very well spell the end of the reign of the godking; Jimmy's just about used up the last of his cred. Then again, that might not be such a bad thing. Maybe it's time to start soliciting donations for the fork.

Another pertinent comment, again from metafilter:
I have to say, finding out a senior editor was lying through his teeth doesn't bother me much. Having the stalwart leader of the organization say "eh, whatever" though -- that does not bode well.

Wales had a good idea and made it work, but like most entrepreneurs he is ill equipped to run a large and stable organization. There will come a time (perhaps there already has) when the best thing he can do for the future of Wikipedia will be to step aside. I wonder if he'll muster the grace to do it?

Thursday, March 01, 2007

More on certain fraudulent Wikipedians

There's some interesting new evidence in relation to Ryan Jordan. Specifically, this edit -- one of Essjay's very first edits -- in which he refers to "his students" and his "Ph.D." (neither of which ever existed). It's quite clear that Essjay invented this persona before he even started editing Wikipedia under the Essjay identity. I cannot, at this time, believe that he created this identity merely to protect his anonymity, as many have suggested. No, no, this was, from its onset, a direct, deliberate fraud.

I still don't understand why Mr. Jordan did this. Perhaps it was social engineer, or simply attention seeking. Whatever it was, however, it was quite thoroughly immoral and unethical of him, and he should be quite ashamed of himself. From his "apology", however, it's quite clear that he's not. One wonders if the fact that his fictitious boyfriend "Robbie" (presumably Essjay's sockpuppet) voted for Essjay on his own RfB will have any effect: normally, the penalty for this sort of vote fraud is a lengthy block and a revocation of all privileges. It's something of a surprise to me that Essjay hasn't oversighted these edits yet.

However, this really shouldn't be about Mr. Jordan. It's not. His problems are real and quite sad and problematic, but that's not the real issue. The real issue here is that Jimmy Wales, despite being aware that Mr. Jordan had put all this energy into creating such an elaborate deception and foisted it off on the Wikipedia Community for nearly two years, nonetheless doesn't "really have a problem with it" (for more context, see this randomly-select blog article). Apparently Mr. Wales doesn't give a shit if his contributors pass themselves off as things they're not, even when representing Wikipedia in the media, lie to him and to everyone else, and generally treat Wikipedia as if it were their giant personal plaything. (Yes, I know, it really is Wales' personal plaything. Maybe he just doesn't mind sharing his toys.) I can only hope that Wales was merely uninformed as to the truth depth of Jordan's deception, and, now that it's no doubt harassing his cell phone continuously, as the media are all trying to get comments from him (hey, anybody want his cell phone number? I have it somewhere! haha, just kidding), he will presumably have enough sense to realize that he really does have to hang Jordan out to dry if he wants to retain any sense of dignity and respectability for Wikipedia.

The only other downside to this is that I'm almost certainly not going to get any sort of timely action on my efforts to get Wales to put an end to publishing libel about me on Wikipedia. I poked him last night on this issue, and he hasn't replied, but of course he's way too busy to deal with the likes of me. Oh well. Nobody believes anything they read on Wikipedia anyway.

Larry Sanger proven right about Wikipedia

Larry Sanger has long argued that Wikipedia is anti-intellectual. And, sadly, he has been proven right, by Jimbo's recent fiat appointment of Essjay to the English Wikipedia's Arbitration Committee. This person, who apparently lied to the Wikipedia community -- and to the New Yorker (scroll to bottom for editor's note about Essjay's deceptions) -- for months about basically every aspect of his identity, including his academic credentials. Amongst the lies Essjay told is that he was a leading scholar in Roman Catholicism. Jimbo -- and much of the Wikipedia community, apparently -- has apparently chosen to forgive Essjay his deceits and appointed Essjay to the Arbitration Committee, where Essjay will be called upon to decide if other editors are behaving in a manner which is beneficial to the project.

Quite frankly, a man who would lie about his academic credentials, and then use those credentials to add undue weight to his own opinions in debate on Wikipedia, does not deserve to even be allowed to edit Wikipedia, let alone sit in judgment over those who do. For examples of where Essjay has done this, see this comment (about Catholic Confession), this comment (about another aspect of Catholicism), this comment (about another Christian topic), and this comment (another Catholic topic). Essjay has no more right to claim to be a scholar of Christianity (and certainly not "Wikipedia's leading scholar on Catholicism"!) than I do, and yet his totally inexpert opinion has been given undue weight because of his lies. It is a travesty that he holds a position of high authority in this community; that he does certainly has a great deal to say about how little the Wikipedia community cares about writing a factually accurate encyclopedia.

I confess that I liked Essjay when I knew him, and in general his judgment was, in my experience, reasonably good. However, I shall never be able to trust him again, not after finding out the extent to which he lied to the Wikipedia community, to others, and (most importantly, from my point of view), to me. The best interpretation of this entire situation is that Essjay created a false identity to preserve his anonymity and then got caught up in it. However, if you want a false identity to protect yourself, that's fine, but don't elaborate it with false claims intended to puff up your own authority and prominence.

Over the past few years, a number of people with included false claims on their resumes or CVs have lost academic leadership posts (for example, Eugene R. Kole, former President of Quincy University, who resigned when two of the degrees he listed in his biography were found to be fictitious). It is startling and telling that Essjay, after revealing similiar lies, is not only not censured, but in fact elevated to one of the highest positions of responsibility that Wikipedia has. Clearly Jimbo has decided to demonstrate just how much unlike academicia Wikipedia is.

Update: Apparently Slashdot is talking about this now. Also, there's an apology (if you can call it that) on Essjay's user talk page at enwiki. I especially like the "If I leave Wikipedia, the terrorists will win" at the bottom. Nice.

2nd update: This is apparently a hot topic on the blogs. See, for example, Freakonomics, Blog World, Infothought, and Rough Type, just for starters.

3rd update: Not only is this getting even more blog play (MetaFilter, Digg, XODP, and Larry Sanger; see also Technorati), but the mainstream tech media is also starting to get involved. This is going to get ugly fast.