(anonymous guest) (logged out)

Copyright (C) by the contributors. Some rights reserved, license BY-SA.

Sponsored by the Wiki Symposium and the Nuveon GmbH.

 

Add new attachment

Only authorized users are allowed to upload new attachments.

This page (revision-31) was last changed on 19-Oct-2007 00:37 by 207.171.180.101  

This page was created on 22-Feb-2007 08:25 by ChristophSauer

Only authorized users are allowed to rename pages.

Only authorized users are allowed to delete pages.

Difference between version and

At line 81 changed one line
[#1] For reference, [[RadomirDopieralski]] wrote the following on [[Talk.Require Space After Bullet Proposal]] on 2007-02-09:
[#1] For reference, [[RadomirDopieralski]] wrote the following on [[Talk.Require Space After Bullet Proposal]] on 2007-02-09: [[BulletListOnWikipediaResearch]]
At line 83 removed 19 lines
I did a little experiment: I downloaded the backup of the english wikipedia's all pages, and looked at the percentages of both styles of 1st level lists in them. Unfortunately, I was able to only extract about 6.3GB of text, as I ran out of disk space. Anyways, I hope that the sampling is not biased because of that.
In the sample I checked there are {{{1 763 983}}} first level list items with a letter (a-z, A-Z, 0-9) immediately following the asterisk. The average length of these items is 90.2 characters or 12.3 words. 80% of them didn't have a space in front of the bullet too.
There are {{{4 863 709}}} first level list items with a space or tab immediately after the asterisk. The average length of them is 81 characters or 10 words.
There are also {{{5 381 592}}} first level list items with neither a space or a letter right after the bullet (nor an asterisk, of course). 25% of them were lists starting with bold or italic text.
This means, that over 26% of list items start with a letter immediately after the bullet, and over 57% of 1st level list items didn't have a space after the bullet. This is an enexpectedly high result.
I didn't mean to count the average length of the entries, but I used {{{wc}}} without any parameters, so this data came for free. I found it interesting that spaceless items are on average longer than the "spaced" ones. I went to several randomly picked pages, and checked their history. It turns out that the list items wereinitially paragraphs, but somebody decided that they look better with a dot in front of them, so he went trough the source and added an asterisk at the beginning of every paragraph. I don't know in how many cases it was what happened, but one is sure -- the experienced users will use the minimal markup that works -- especially when reformatting existing text.
Now the results for lists with higher nesting level than one:
* 657078 list items without a space
* 389956 list items with a space
* 62% of 2nd and higher level list items without a space after the bullets
Honestly, I don't really know what that means :)
Version Date Modified Size Author Changes ... Change note
31 19-Oct-2007 00:37 7.075 kB 207.171.180.101 to previous Fixed grammar
30 19-Oct-2007 00:36 7.071 kB 207.171.180.101 to previous | to last Fixed grammar
29 19-Oct-2007 00:33 7.069 kB 207.171.180.101 to previous | to last Fixed spelling and grammar
28 26-Sep-2007 09:46 7.065 kB ChuckSmith to previous | to last restore
27 26-Sep-2007 01:53 7.098 kB 203.69.39.251 to previous | to last
26 26-Sep-2007 01:52 7.078 kB 60.250.153.11 to previous | to last
25 26-Apr-2007 08:23 7.065 kB ChristophSauer to previous | to last added link to proof of conept
24 23-Apr-2007 16:21 6.935 kB ChristophSauer to previous | to last typo
23 21-Apr-2007 15:41 6.938 kB ChristophSauer to previous | to last removed mistycal software -> factored out
22 20-Apr-2007 10:16 9.104 kB ChristophSauer to previous | to last Mystical: I was wrong, the werewolf exists.
21 19-Apr-2007 21:57 8.241 kB GregorHagedorn to previous | to last Mystic software...
« This page (revision-31) was last changed on 19-Okt-2007 00:37 by 207.171.180.101