Wikipedia:Wikipedia Signpost/Single/2011-05-16
Geographical distribution of Wikipedia edits; Sue Gardner interviewed; brief news
Geographical distribution of Wikipedia edits visualized
Erik Zachte, Data Analyst at the Wikimedia Foundation, has produced visualizations showing the geographic provenance of edits to various language versions of Wikipedia on a particular day, using bubble maps, heat maps and animations. For example the bubble map for the spatial distribution of edits for the Chinese Wikipedia on that day clearly indicates that most contributions came from Hong Kong and Taiwan, instead of mainland China. The maps for the Ukrainian and Turkish Wikipedias hint at noticeable diaspora communities outside the main country, in contrast to, for example, the Norwegian Wikipedia. The spatial distribution for all languages combined shows a clear predominance of edits from within Europe.
The data used for the visualizations included the IP addresses of logged-in editors, which are subject to the WMF privacy policy. To protect editor privacy, edit timestamps were obfuscated by adding small errors, and geocoordinates were rounded.
Sue Gardner and Hisham Mundol interviewed
Wikimedian GerardM celebrated his birthday by interviewing the Foundation's Executive Director Sue Gardner on his personal blog. Among other things, she discussed preliminary results of the 2011 editor survey conducted recently by the WMF, which showed a markedly higher satisfaction with the performance of the WMF than with that of chapters (on a scale from 1 to 10, respondents ranked their own performance at 6.24, that of volunteers overall at 7.14, the Wikimedia Foundation's at 7.32 and that of chapters at 4.33). She believed this result was due to the relative youth of chapters. (In March, Gardner had—somewhat controversially—warned of possible risks from the inexperience of many chapters' boards, and Deputy Director Erik Möller had earlier stated that "the very existence of chapters [was] increasingly being questioned", a statement that was questioned itself. Cf. Signpost coverage.) She also commented on recent rises in traffic for the Chinese Wikipedia (with a general rise in Chinese Internet users as the most likely explanation) and from mobile devices.
Earlier last week, GerardM interviewed Hisham Mundol, the WMF consultant for National Programs, India.
Briefly
- Should grant-giving organization get observer status on the Wikimedia Board?: The Foundation's Board of Trustees published the minutes for its IRC meeting on February 21 last week. Among other topics that have already been covered in the Signpost, the minutes note that the Sloan Foundation was considering to renew the $1million/year grant it awarded Wikimedia in March 2008 (Signpost coverage: "Wikimedia receives 3-year, $3,000,000 grant"), and "had indicated an interest in having a Board observer, something not covered in our current Board protocols". The Board discussed the possibilities for such observers, or opening up meetings in general, and stated that it "was generally comfortable with having invited observers from time to time, but not at all sessions". The matter was deferred, with the Board's governance committee preparing options for the next meeting. (The minutes for two more recent Board meetings in March and April are still to be published.)
- AGK on Wikipedia over time: Anthony, a long time editor on the English Wikipedia, wrote a blog post on the Wikimedia blog to reflect on the past, present and future of Wikipedia, to give his thoughts on what it means to him to be a long-term contributor to Wikipedia and what he hopes to see happen in the coming years.
- Readability of user warnings: Hungarian Wikipedian Bence Damokos examined "the readability of user warning messages" on the English Wikipedia, calculating the readability scores according to several well-known formulae (including Flesch-Kincaid) for 105 template messages. Ranked by the SMOG readability measure, Template:Uw-aeblock came out the most difficult to read, with 18.49 years of education estimated necessary for reading it. Still, Bence Damokos concluded that "the warning messages aren’t unreasonably unreadable, although the various deletion notices, especially the ones concerned with copyright are written in a way that is too difficult to understand by the average user."
- Milestones: The Italian Wikipedia reached 800,000 articles on May 11, one day after the Polish Wikipedia had crossed the same milestone.
- Meetups: Community meetups took place this month in Washington, D.C. (May 7); Bangalore (May 8); Pune (May 14); and Hyderabad (May 15).
- Ten year anniversaries: Last week marked the tenth anniversary of various Wikipedia versions, as noted on Foundation-l.
- URL shorteners: Discussions continued last week about the domains enwp.org or wp.org as official URL shorteners for Wikipedia.
- Wikipedian in residence at Picasso museum: The Museu Picasso has announced Catalan museum management student Àlex Hinojo (User:Kippelboy) as its Wikipedian in residence. Details are forthcoming. In March, Kippelboy had been announced as "Wikimedia's first official GLAM Ambassador" (Signpost coverage).
- Joan Miró foundation experiments with Wikipedia QR codes: On his blog, Kippelboy reported on a collaboration with the Fundació Joan Miró to test QR codes displaying multilingual Wikipedia links next to 15-20 artworks in an upcoming Miró exhibition (as currently done by the Derby Museum and Art Gallery in the UK). The corresponding articles are to be created in a WikiProject, which will involve a hackathon at the museum's library.
- Board calls for openness: The Wikimedia Board's resolution on openness from last month, urging the community "to promote openness and collaboration", has been translated into various languages, forming the May 2011 Update.
- Student "sucked into" Wikipedia by Public Policy Initiative: A posting on the Foundation's Community blog described how participating in a university course within the WMF's Public Policy Initiative got UC Berkeley undergraduate User:Kgorman-ucb "hooked on Wikipedia", from making just a few anonymous edits to participating intensively in areas beyond the course, namely editing articles about mushrooms and patroling new pages. He will also teach the next version of the (student-led) course himself.
- World Heritage: Sebastian Sooth, project manager at Wikimedia Germany, posted an update on the discussions about the proposal to have Wikipedia classified as UNESCO world heritage - which the chapter first suggested in March, and presented at the Wikimedia chapters meeting in the same month ("Wikipedia as a World Heritage Site! Why should Wikipedia become a World Cultural Heritage?").
- Cooperation with Nobel Museum: The Swedish Wikimedia chapter and the Nobel Museum have initiated a partnership, according to a brief announcement (in Swedish) on the museum's website. The collaboration with Swedish Wikipedians will start this summer.
Reader comments
Education minister's speech copied from Wikipedia; Jimmy Wales interviewed; brief news
Education minister's speech copied from Wikipedia
A speech given in the UK House of Commons by British further education minister John Henry Hayes was largely copied from Wikipedia, according to a detailed comparison by ePolitix.com. Hayes was responding on behalf of the government to a private member's bill to create an additional bank holiday, and numerous sentences in his speech correspond almost verbatim to parts of the article Bank holiday. ePolitix noted that the speech was given on a Friday, where attendance in parliament is usually very low, and mentioned the minister's remark that the particular section of the speech had been prepared for him to read out, but nevertheless called the incident "not exactly a glowing example for students", pointing out that Hayes "is an honorary member of the Association of Teachers and Lecturers".
Jimmy Wales profiled by USA Today and Chicago Sun-Times
Last week, Jimmy Wales was interviewed by USA Today ("Wikipedia's Jimmy Wales takes wiki work seriously"). Among other things, the paper noted his recent editing about current events, having been the Wikipedian who moved Catherine Middleton to Catherine, Duchess of Cambridge during the British royal wedding ("Yes, I am just that big a Wikipedia geek that I was waiting with my finger on the mouse button..."); however, another Wikipedian overtook Wales in updating Wikipedia with the death of Osama bin Laden.
The Chicago Sun-Times's profile of Jimmy Wales ("Wikipedia still ad-free at 10") focused on financial aspects: "Wales splits his time between London and Florida and says he earns a living by making speeches to industry groups and schools." The newspaper quoted an online marketing consultant who estimated that Wikipedia is foregoing at least "$1 billion" in advertising revenue but added "that's good. There should be some places online without ads."
Briefly
- 3% of scientists edited their Wikipedia article: A Nature poll revealed "how researchers guard, and sometimes burnish, their online image", including the fact that "Nearly 3% of respondents had edited their own Wikipedia biographies", and several of them started them themselves, or "or edited entries to include references to their own papers." Among other scientists, the Nature article cites Wikipedia administrator Darren Logan, a gene researcher at the Wellcome Trust Sanger Institute.
- Wikipedia as a force for British punctuation: A Slate article on "the rise of 'logical punctuation'. [sic]" (the practice of placing commas and periods outside of quotation marks instead of inside) called Wikipedia "by far the biggest fount" of the practice today. As opposed to many professionally edited publications in the US, Wikipedia's Manual of Style explicitly endorses logical punctuation "because it is deemed by Wikipedia consensus to be more in keeping with the principle of minimal change" (of quotes, compared to the original).
- Impact of Google Doodles: US public relations blogger Steve Rubel examined the effect of three recent Google Doodles on web traffic using the example of Wikipedia, with help from the Wikipedia page view statistics tool by User:Henrik at http://stats.grok.se/. For example, the article on English child book illustrator Roger Hargreaves received 1.9 million views when he was featured as a Doodle on May 9, compared to 3,500 article views during the entire month of April. (Also, the "Wikitrends" tool listed the article as the number one uptrend for last week.) Last year, a list of the five articles which had received the highest hourly hit rates between January and July 2010 had included one which was connected to a Google Doodle (Signpost coverage).
- Controversial PR company reworks its Wikipedia article: After PR firm Burson-Marsteller was recently exposed as having been employed by Facebook to plant negative media stories about Facebook's rival Google, a blogger noted that the Wikipedia article about Burson-Marsteller had recently been extensively reworked by an editor self-identifying as working for the company, asserting that this "violates Wikipedia's internal regulations. And stinks to high heaven." The blogger somewhat inaccurately claimed that all information about the company's former controversial clients (such as the Argentine dictatorship of the 1970s and chemical company Union Carbide after the Bhopal disaster) had been removed. The user working for Burson-Marsteller also requested help on dealing with critical edits to the article made after last week's revelations concerning Facebook.
- Foundation co-sponsors data visualization award: The WikiSym organizers have announced the "WikiViz 2011 Data Visualization Challenge", co-organized with the Wikimedia Foundation (one of the conference's Premium sponsors) "to create the most insightful visualization of open collaboration data" under a free license.
- Jimmy Wales: UK enforcement of celebrities' privacy is "human rights violation": In the recent debate in the UK about several superinjunctions (prohibiting the publication of certain statements about several celebrities in the UK and the reporting about these prohibitions themselves) and attempts to use Wikipedia to circumvent them (Signpost coverage: "Administrators removing material that violates UK legal injunctions"), Jimmy Wales spoke out on the BBC last week, calling the superinjunctions "ridiculous" (audio recording, around 0:28:00, available online until May 18, BBC online summary: "Wikipedia boss Jimmy Wales criticises injunctions"). He explained that the information was being removed from Wikipedia due to its policy on reliable sources and noted that the WMF was based in the US and subject to US law - "that doesn't mean that we don't take into consideration laws of other places, but in the end of the day, it is US law that we follow". Emphasizing he was merely formulating his personal opinion, he criticized the superinjunctions, and more generally all privacy laws which restrain the publication of "legally obtained, truthful, factual information about public figures" (with the possible exception of cases where lives are directly endangered) as "grave injustices" and "human rights violations". Asked by the interviewer "if someone wanted to publish something that is truthful about your sex life or your private life, would that be OK?", Wales replied that there was a difference between being "OK" and being legal, and that there was a lot of unsavory reporting by newspapers which should nevertheless not be subject to state intervention.
Reader comments
Back to Life: Reviving WikiProjects
This week, we took a break from our regular interview format to focus on ways Wikipedians can revitalize semi-active and inactive WikiProjects. Our "WikiProject Report" feature has looked at more than 100 WikiProjects in the past three years, but there are countless projects that have come and gone without any fanfare. You can browse the greyed-out names in the WikiProject Directory or the vast inactive WikiProject category to get an idea how many good intentions have long since been abandoned, merged, or otherwise stopped functioning. Some of these inactive and semi-active WikiProjects may be ripe for a reboot if enough editors are willing to dedicate time to rebuilding and maintaining the project.
Projects may lapse into a dormant state for a variety of reasons. Inactivity may be a natural part of a project's life, occurring when the project's goals are achieved or the circumstances surrounding the project's purpose change. For instance, WikiProjects created as part of an academic course typically cease activity when the semester ends; Harvard's Cyberlaw course in 2006 and Croatan High School's AP Biology class in 2009 are two examples. When the International Race of Champions was discontinued, the members of WikiProject IROC voted to disband. When the Signpost's WikiWorld comic strip ended, the purpose for WikiProject WikiWorld's existence ended as well. Projects that have naturally reached the end of their life are marked as defunct and a revival is unlikely.
However, there are many projects that die long before their prime. Some of these projects never built a substantial membership because their focus was too narrow or too broad, the project's goals were unclear, or the project never received adequate publicity. Submitting a proposal to the WikiProject Council before building a WikiProject can help attract Wikipedians who are interested in the topic, clarify the project's purpose, and help prevent false starts. Networking with Wikipedians that share similar interests can help build a project's membership, and announcing the project's news in the Signpost's WikiProject Report can reach larger audiences.
For projects with too narrow a focus, it may be better to merge the project into a larger, more active WikiProject. This is how many task forces are born; for example, the Desserts Task Force was previously WikiProject Ice Cream. But some narrowly focused or poorly defined projects may have so little content that they don't warrant a revival. The old project may end up deleted, although editors can still create a new project that serves a similar purpose, building on the lessons learned from the previous failed attempt.
At the other extreme are projects with a focus that is too broad. Some broad projects simply become shells for grouping together smaller child projects while others end up reinventing themselves in order to stay relevant. For example, the first WikiProject ever created, WikiProject Sports in September 2001, has struggled to remain active as specialized projects for nearly every sport have moved editor attention to their own talk pages, assessments, and portals. In contrast, the recently revived WikiProject United States has been reorganized and absorbed some responsibilities previously held by other projects. Discussions continue about the new scope of WikiProject United States and its relationship to child projects.
Bots can save time and effort
|
Even with proper planning, many projects simply run out of steam. As the initial enthusiasm of the project's members wanes, the project's initiatives may become neglected and fall into disrepair. Productive members may leave projects after heated disagreements about specific articles or hurt feelings after mediation and arbitration. Busy editors may become so engaged in other activities both on and off Wikipedia that they stop visiting WikiProjects in which they were previously actively involved. Regardless of whether these projects are tagged as inactive, semi-active, or not at all, an effort to revive the project may be worthwhile.
The WikiProject Guide provides helpful advice for reviving a WikiProject. A good place to start is updating the list of active members to get a better picture of where the project currently stands. Some members may still be active on Wikipedia and may even continue to improve the project's articles independent of the WikiProject. Others have moved on to other things or have left Wikipedia entirely. Divide the member list into active, inactive, and former members with each member determining their own status. Check each user's contributions to see if they have been active on Wikipedia in the past few months; members who have not edited Wikipedia in several months can be classified as inactive until they return. To bring old members back and involve them in the project's revitalization, post a message on the talk pages of editors asking them to update their status and include ways they can contribute. New members can be invited personally or through public postings at the Community Portal or the WikiProject Report's news sidebar. Simply adding names to the participants list won't spur activity. In each invitation message, ask that members add the project's talk page to their watchlist and suggest ways they can aid the project's initiatives.
Improving the aesthetics of the project's page may be appropriate, although attractive projects can lapse into inactivity as quickly as projects with only black-and-white text. A new coat of paint won't fix the underlying problems of many floundering projects, but improved organization can help new and returning members find information more easily. There are a variety of generic WikiProject templates that may be helpful. Templates like {{Infobox WikiProject}} can provide a starting point for new members as they discover the project's resources. Unnecessary clutter can be removed from the project's main page, although this information should be archived in case it becomes important again. Clear suggestions as to how members can help should be easy to find, including to-do lists, {{tasks}}, cleanup listings, and watchlists. The {{WikiProject help}} template can also direct attention to the project's pressing needs.
During the project's inactive or semi-active period, portions of the project may have fallen into disrepair, templates may have been deleted, or services provided to the project may have changed or discontinued. Recreate any missing userboxes, project banners, or user invitation templates. Check the assessment system and the project's banner to ensure it still works. Parameters for automated functions like article alerts and hot articles may need to be updated. If the project has a portal, check that the fields are populated with content and make sure additional content is ready and waiting for when the portal is refreshed.
Reach out to projects that are already active and form collaborations where possible, particularly on articles where projects overlap. Competitions and drives like those held on alternating months by the Guild of Copy Editors and WikiProject Wikify can build relationships and mobilize the members of another project to achieve your project's goals.
In areas where a few small projects struggle to remain active, a single centralized project may be better. Mergers can concentrate the attention of members and allow them to more easily pool their resources, although the actual merging of projects can become contentious. Make sure all parties have been notified and have a chance to discuss a merger before it takes place.
Above all, the editors who are spearheading the revival should be open to the ideas of other members. Just like Wikipedia articles, WikiProjects are not owned by any editor no matter how much work that editor puts into rebuilding the project. Additionally, the Wikipedians involved in reviving a project should be dedicated to the project's goals before and after the revival. This means remaining active in the project, even after the project's initial rebirth. Simply creating a WikiProject and leaving it to take care of itself does not guarantee the project's survival, nor does simply revamping a project. Participate in the project's discussions and work on improving the project's articles. Building a thriving community takes time, but projects like WikiProject Oregon and WikiProject Military History show that it's possible.
Next week, we'll burn some rubber. Until then, make a pit stop in the archive.
Reader comments
Best of the week
New administrators
Two editors were reconfirmed as administrators:
At the time of publication there are three live RfAs: DeltaQuad 2 due to finish Thursday UTC, Sadads, due to finish Friday, and Logan, due to finish Friday.
Featured pictures
- Atlantic mackerel (nom; related article) (Created by User:Lviatour.) picture at bottom
- Sea foam (nom; related article), a type of foam created by the agitation of seawater, particularly when it contains higher concentrations of dissolved organic matter (including proteins, lignins, and lipids) derived from sources such as the offshore breakdown of algal blooms. (Created by User:Mbz1.)
- Agnes Milowka, underwater cave explorer (nom; related article), a notable underwater cave technical diver who contributed to the movie blockbuster Sanctum and who recently died. (Created by James Axford.) picture at right
- The Anti-Slavery Society Convention, 1840 (nom; related article), a historic painting by Benjamin Robert Haydon (died 1846), given to the National Portrait Gallery, London in 1880 by the British and Foreign Anti-Slavery Society. Oil on canvas, 1841. 2972 × 3836 mm (117 × 151 in). picture at top
- Black Rhinoceros (nom; related article), photographed in the Ngorongoro Crater by User:Ikiwaner, who said, "While some may want the feet to be shown, it's important to remember that no one mows the Savannah".
- Vegetable Lamb of Tartary (nom; related article), a diagram of the legendary zoophyte believed to grow sheep as fruit. It was historically used as an explanation for the origin of cotton. (Created by User:Mgiganteus1.)
Featured articles
There were no new featured articles. Anthony Michael Hall (nom) was delisted because of referencing problems.
Featured lists
- List of U.S. Women's Open (golf) champions (nom) (Nominated by SaysWhoWhatWhenWhereWhyHow?.)
- General Secretary of the Communist Party of the Soviet Union (nom) (Nominated by Trust Is All You Need.)
- Miley Cyrus discography (nom) (Nominated by Ipodnano05.)
- List of Scotland national football team hat-tricks (nom) (Nominated by Miyagawa.)
- Latin Grammy Award for Best Male Pop Vocal Album (nom) (Nominated by Jaespinoza.)
- Grammy Award for Best Jazz Fusion Performance (nom) (Nominated by Another Believer.)
Featured topic
Atlantic Coast Conference football championship games (nom), with four featured articles and four good articles. The ACC Championship Game is an American college football game held on the first Saturday in December by the Atlantic Coast Conference (ACC) each year to determine its football champion. The game pits the champion of the Coastal Division against the champion of the Atlantic Division following the conclusion of the regular season (nominated by JKBrooks85 and praised by the reviewers as fine work).
Featured sounds
Newly promoted sounds will be included in next week's edition.
Reader comments
Motions – hyphens and dashes dispute
The Arbitration Committee opened no new cases. Two cases are currently open.
Open cases
Racepacket (Week 3)
During the week, 40 kilobytes was submitted in on-wiki evidence before the evidence phase came to a close. Drafter PhilKnight submitted a complete proposed decision in the workshop, where comments were submitted by arbitrators, parties, and others.
Tree shaping (Week 3)
The evidence phase came to a close during the week. Further comments were submitted in the workshop.
Motions
A request for arbitration concerning a hyphens and dashes dispute was filed last week. In lieu of opening a case at this time, the Committee passed the following motions:
- 1RR, civility probation, and moratorium
- All discussions on the subject of en dashes in article titles discussion are subject to civility and 1RR restrictions, broadly construed.
- Administrators are urged to be proactive in monitoring and assertive in keeping debate civil. Actions requiring clarification can be raised with the Committee on the appropriate subpage.
- A moratorium has also been imposed on article title changes concerning hyphen/endash exchange; the only edits allowed will be to create a redirect to the existing article title until the resolution of the debate below.
- Discussion for obtaining consensus
- With the goal of obtaining consensus, interested parties have been instructed to spend their time determining the structure of a discussion on en dashes in article titles. This may be the continuation of a current discussion or commencement anew. After May 2011, six weeks will be provided for gathering consensus on the issue. The discussion should be of sufficient structure to allow easy quantification of consensus rather than a large amount of poorly-framed debate.
- Consequence of insufficient resolution
- If a determination isn't realised by mid-July, the Committee warns that a case will be opened and conduct violations will be dealt with severely.
Reader comments
Berlin Hackathon; April Engineering Report; brief news
Hackathon unites MediaWiki developers in Berlin
As previewed in last week's Signpost, the now annual Berlin Hackathon was held this week in the German capital, hosted by Wikimedia Deutschland and attended by 100 participants (among them 32 Foundation employees). Predictably, the event received a large amount of digital coverage, including several postings on the Wikimedia Blog and a live video stream from which a large number of recordings were taken (album). It was accompanied by meetings of the Wiki Loves Monuments team, whose preparations for an Europe-wiki photo contest include various software tasks, and of the Language committee.
Many of the topics featured in Friday's "lightning talks" will be familiar to regular readers of the Technology Report: development of a new parser that would make WYSIWYG editing so much easier, for example, and the new Virginia data centre (which will be online from next week). They did, however, cover some new ground, including operational details. Mark Bergsma noted that the two data centres are unlikely to both channel traffic at the same time: instead, one will serve 90% of the traffic while the other will play a secondary, backup role, though the arrangement will be flipped at monthly intervals. WMF developer Neil Kandalgaonkar talked about a recent proof-of-concept hack combining MediaWiki with Etherpad-style realtime collaborative editing in form of the "Hackpad" software (also described on Wikitech-l and in a blog post). There were also talks on the "Kiwix" offline reader (cf. previous Signpost coverage), Firefogg 2, and the Narayam extension among others. In addition to these familiar themes, the line-up also included unfamiliar names, such as PhotoCommons (a new WordPress plugin) and qunit, a JavaScript testing suite that could open up interface testing to the crowd.
Saturday proved just as informative: Trevor Parscal reported the results of trialling left-aligned section edit links (116% more clicks and 9% more edits resulting); a tentative "end of July" release date was set for MediaWiki 1.18, which will bring gender intelligence and the ability to rotate photographs to MediaWiki.
Brion Vibber, who is leading the Foundation's efforts to rewrite the parser, laid down some of his ground rules (including "no new syntax" and a gap between the release of the new parser and its use on the majority of existing pages,), see also mw:Future. According to a tentative timeline, Wikimania in August will see working demos and in November, some beta code being will go live on Wikimedia sites in the form of opt-in gadgets.
There were, however, also heterodox opinions presented. Hannes Dohrn, speaking on behalf of the Sweble Wikitext Parser team, strongly advocated a move away from the familiar "apostrophe syntax" for bold and italic, along with further changes to the wikitext syntax. In a blog post on the same day, the Sweble team made an "offer to the Wikipedia community": "Come up with a new and better Wikitext and use the Sweble Wikitext parser to convert old Wikipedia content to that new format. ... We have spent more than one year full-time working on a parser that can handle the complexities of current Wikitext and it does not make sense to us to create another one."
With one of the WMF's two top product development projects (the parser) discussed on Saturday, on Sunday WMF User Interface Designer Brandon Harris gave a talk on the second, editor retention and improving the "-1 to 100 edit experience". More specifically, he outlined the thinking behind upcoming changes to make the "identity" of contributors more visible, e.g. displaying affiliations, interests and roles, which is hoped to strengthen communities by connecting members with tasks and collaborators corresponding to their interests. He acknowledged that "the big fear is that we are going to turn Wikipedia into Facebook", and promised that this wouldn't happen, but argued that there were things to learn from Facebook in successful community building. Harris predicted that some of the upcoming changes will be controversial, but concluded the talk by advising "it's time to achieve Zen acceptance that we are going to be adding these things, in order to save the projects".
Domas Mituzas, a paid developer for Facebook and a MediaWiki volunteer, spoke on improving performance, especially given the increasingly large number of actions that could seriously slow down larger wikis, e.g. changing widely used templates. A key goal is to reduce the burden of citation processing, which can occupy over 50% of the parsing workload for a given page. Mituzas emphasized the relation between performance and editor retention (sluggish website response times while editing can drive contributors away). Both he and (in the next talk) Tim Starling talked about HipHop integration, with its improvement in performance of up to 400%. It is envisaged to be deployed in late 2011 or early 2012.
Mark Bergsma gave an address (with slides) on Wikimedia's preparations for World IPv6 Day on June 8.
This year's motto was "talk less, code more", and, in addition to the talks, a significant amount of development work ("hacking") was completed: 65 bugs were squashed (see In brief below, for highlights), progress was made on localisation work, and even the start of an attempt at one of the features most requested by Wikimedians, global watchlists. An overhaul of the HTTPS access to Wikimedia sites (outlined last month by Brion Vibber and Ryan Lane) was being worked on.
The three day event also included social elements such as parties and outings. The ongoing success of the Berlin hack-days is almost certain to create interest among other chapters, including the United Kingdom chapter where proposals for a similar event have already been floated.
Further information: Friday blogpost, Friday notes, Saturday blogpost, Saturday notes, Sunday blogpost, Sunday notes, Monday blogpost.
April Engineering Report published
The Foundation's Engineering Report for April was published last week on the Wikimedia Techblog, giving a brief overview of all Foundation-sponsored technical operations in the last month. Given the ten day publication lag this month, the updates provided on many of the major development threads have since been superseded by relevant talks at the Berlin Hackathon (see above). However, the report also gave details on a number of other projects not covered there. For example, it highlighted how work on the mobile projects has "taken off" and the department's involvement in budgeting for the 2011-2012 fiscal year.
Also in the report was news of contractor Russ Nelson's work in improving the media storage and retrieval architecture by deploying new software in the form of "OpenStack Swift", which was deployed on three test servers. In April they started handling a small amount of operations, though the result of this test was not given. The WMF also said that it had received the account details that it needed from Google to start taking up their offer of backup space, helped by a transition to a new, more powerful server from generating the dumps themselves. A new search indexer was installed last month, resolving the space issues experienced with the older server and the Foundation's ability to monitor its uptime was also improved. It wasn't all plain sailing during April, however: according to the report, a click tracking trial with the Article Feedback tool had to be abandoned during the month, as did an upgrade to the Foundation's Squid software (see previous Signpost coverage). Work on keeping the code review backlog down was also unsuccessful, though a specific effort to handle shell bugs proved more effective.
The report added more detail on the new Article Feedback tool, which was recently expanded to cover 100,000 articles on the English Wikipedia, noting that, in a possible evolution of the tool, "users [would] be able to promote particularly relevant reviews to the talk page of the article reviewed" and that the tool would consider the credentials of the reviewer in question. References to a new LiquidThreads timeline were also included with their August release date, along with details of significant work on the project by its sole developer, Andrew Garrett. At the same time, Roan Kattouw and Timo Tijhof started to work on specifications for a version 2 of the ResourceLoader. In addition, following discussions about the deployment of the Interlanguage extension to Wikipedia, designer Brandon Harris made several recommendations to the extension's developers. Also included was news from Wikimedia's commercial partners PediaPress, who have patched the current extension so that generated openZim exports (i.e. articles saved in a format suitable for offline browsing) now have a navigable table of contents. They also report that more than a thousand openZim files were downloaded in just one week of April. In addition, the PoolCounter extension for avoiding repeats of the "Michael Jackson" effect was successfully deployed at the second attempt.
In brief
Not all fixes may have gone live to WMF sites at the time of writing; some may not be scheduled to go live for many weeks.
- At the Berlin Hackathon, developers decided they knew of no remaining performance concerns with the use of email notifications for user talk pages, allowing its use on larger wikis (bug #5220). Because most editors will find it of use, it was enabled by default on the English Wikipedia and other projects, prompting some confusion when Wikipedia emails were sent to users for the first time.
- Sergey Chernyshev was replaced by Tom Gries as the maintainer of the OpenID extension for MediaWiki, an extension still considered a possibility for Wikimedia sites if it proves to be scalable and useful (wikitech-l mailing list).
- With the resolution of bug #28914, bots on the English Wikipedia are now automatically IP block exempt (but not Tor-block exempt).
- A major flaw in the Titleblacklist which prevents "meaningless" names such as Image.jpg was fixed. It prevented Internet Explorer users from uploading files (bug #28918).
- Bug #24037 was closed, with a tentative number of bytes added or removed indicator appended to Special:Contributions.
- The website Embedly, a "platform for converting URLs into embeddable content" now returns thumbnails for images on Wikipedia (example) and Wikimedia Commons (example), making it easier to reuse content.
- The "Your changes" box seen during edit conflicts is now read-only, preventing users from mistaking it for the box they should be typing in (bug #28287).
Reader comments