(anonymous guest) (logged out)

Copyright (C) by the contributors. Some rights reserved, license BY-SA.

Sponsored by the Wiki Symposium and the Nuveon GmbH.

 

Add new attachment

Only authorized users are allowed to upload new attachments.

This page (revision-31) was last changed on 19-Oct-2007 00:37 by 207.171.180.101  

This page was created on 22-Feb-2007 08:25 by ChristophSauer

Only authorized users are allowed to rename pages.

Only authorized users are allowed to delete pages.

Difference between version and

At line 1 added 2 lines
Take a look at how the [[HyphenListMarkupProposalWorks]], it is a proof of concept page for demonstration purposes.
-----------
At line 7 changed one line
Considering the facts behind 25% of list items on Wikipedia not using space after the star **and** 25% of lists have items starting with italics or bold[1], this combination is not an [[EdgeCase]]. This means that proposals like [[RequireSpaceAfterBulletProposal]] would make it more difficult to use Creole than necessary. Proposals like this are a sign that something in the design is wrong and accepting this probosal would only cure a symptom. That's why we ask the comunity to accept this new proposal to switch back to the original design on using hyphens for unnumbered lists.
Considering the facts behind 25% of list items on Wikipedia not using space after the star **and** 25% of lists have items starting with italics or bold[1], this combination is not an [[EdgeCase]]. This means that proposals like [[RequireSpaceAfterBulletProposal]] would make it more difficult to use Creole than necessary. Proposals like this are a sign that something in the design is wrong and accepting this proposal would only cure a symptom. That's why we ask the comunity to accept this new proposal to switch back to the original design on using hyphens for unnumbered lists.
At line 25 changed one line
** common use of double-hyphen to indicate m-dash in text, although this is not really a disadvantage because m-dashes are **never** used at the beginning of a line of text. (there's a distinction between [[hyphens and dashes]])
** common use of double-hyphen to indicate m-dash in text, although this is not really a disadvantage because m-dashes are __never__ used at the beginning of a line of text. (there's a distinction between [[hyphens and dashes]])
At line 30 changed one line
* usually the first guess of endusers, (look into your mailbox for the usage of lists, see [[Line Break Case Studies]])
* usually the first guess of end users, (look into your mailbox for the usage of lists, see [[Line Break Case Studies]])
At line 39 changed one line
In the following section we will show solutions for the abiguities already discussed on the [[Talk.Lists]] page involved with hyphens. Opposed to the //bold as the first formatting in a list item//, we consider these combinations as root for ambiguities as [[EdgeCases]].
In the following section we will show solutions for the ambiguities already discussed on the [[Talk.Lists]] page involved with hyphens. Opposed to the //bold as the first formatting in a list item//, we consider these combinations as root for ambiguities as [[EdgeCases]].
At line 43 changed one line
In general a unnumbered list alway has to start with **one** hyphen.
In general an unnumbered list alway has to start with **one** hyphen.
At line 68 changed one line
a equal sign as the first character. While we do not loose a word about the headings so far there has been
an equal sign as the first character. While we do not lose a word about the headings so far there has been
At line 78 changed one line
If we would use space as a escape character we would not be able to allow space in front of lists for example anymore.
If we would use space as an escape character we would not be able to allow space in front of lists for example anymore.
At line 82 added one line
== Remaining Problems Identified in Discussions
At line 81 changed one line
[#1] For reference, [[RadomirDopieralski]] wrote the following on [[Talk.Require Space After Bullet Proposal]] on 2007-02-09:
=== Hard line breaks
At line 83 changed one line
I did a little experiment: I downloaded the backup of the english wikipedia's all pages, and looked at the percentages of both styles of 1st level lists in them. Unfortunately, I was able to only extract about 6.3GB of text, as I ran out of disk space. Anyways, I hope that the sampling is not biased because of that.
A hard line break is a line break that is saved explicitly in the text by putting in a line break character, that is not visible to human readers, but will be used by parsers. A human user inserts this invisible character when he hits return. Some Software products also might insert hard line breaks when they wrap the line, see [[Hard Line Breaks]] for a list.
At line 85 changed one line
In the sample I checked there are {{{1 763 983}}} first level list items with a letter (a-z, A-Z, 0-9) immediately following the asterisk. The average length of these items is 90.2 characters or 12.3 words. 80% of them didn't have a space in front of the bullet too.
The [[ListMarkupLinebreakArgument]] shows that there remains the issue that users, and worse, software, might do hard line breaks before hyphens. This is not a problem special to list markup with hyphens, this also affects all other markup characters that indicate special meaning at the beginning of a line.
At line 87 changed one line
There are {{{4 863 709}}} first level list items with a space or tab immediately after the asterisk. The average length of them is 81 characters or 10 words.
{{{
**START** I think these are realistic examples
- either using simple hyphens as here - or the
-- much nicer -- n/m-dash way of writing.
And the minus sign (like
-1) even must be placed there. Finally, whereas
in English only trailing hyphen-constructs like
sub- and superclass are common, in German also
leading hyphen-constructs occur, e.g. subclass and
-property. **END**
}}}
At line 89 removed one line
There are also {{{5 381 592}}} first level list items with neither a space or a letter right after the bullet (nor an asterisk, of course). 25% of them were lists starting with bold or italic text.
At line 91 changed one line
This means, that over 26% of list items start with a letter immediately after the bullet, and over 57% of 1st level list items didn't have a space after the bullet. This is an enexpectedly high result.
Here's an example text, using equal signs (headings) and number signs (numbered list).
At line 93 changed one line
I didn't mean to count the average length of the entries, but I used {{{wc}}} without any parameters, so this data came for free. I found it interesting that spaceless items are on average longer than the "spaced" ones. I went to several randomly picked pages, and checked their history. It turns out that the list items wereinitially paragraphs, but somebody decided that they look better with a dot in front of them, so he went trough the source and added an asterisk at the beginning of every paragraph. I don't know in how many cases it was what happened, but one is sure -- the experienced users will use the minimal markup that works -- especially when reformatting existing text.
{{{
**START** I think these are also realistic
examples using #1 equals signs, or #2 number
signs in normal text. Where #2 might be frequent,
#1 not as frequent. Also someone could use
equals signs in texts. Since we allow one
= equal sign to be a valid header. Therefore
this also causes trouble. So someone could use
an example in an exercise where he uses
-2 as some value and tells someone should multiply
it with a value so that the end result
=4. Someone also might use stars as regular
footnote markers, etc.. **END**
}}}
At line 95 changed one line
Now the results for lists with higher nesting level than one:
It is true, that the first example with hyphens is more frequent than the second example.
At line 97 changed 3 lines
* 657078 list items without a space
* 389956 list items with a space
* 62% of 2nd and higher level list items without a space after the bullets
=== Implications
A user that breaks the line hard (hitting return) has to be aware that he should not use markup characters as the first character in the next line, that have meaning as a first character in a line (no matter if this proposal will be accepted or not). You should not use software that is breaking the line hard, when working with wiki markup.
At line 101 changed one line
Honestly, I don't really know what that means :)
* Equal Signs (Headings)
* Number Signs (Numbered Lists)
* Hyphens (Unnumbered Lists, if this proposal is accepted)
* ..
----------
[#1] For reference, [[RadomirDopieralski]] wrote the following on [[Talk.Require Space After Bullet Proposal]] on 2007-02-09: [[BulletListOnWikipediaResearch]]
Version Date Modified Size Author Changes ... Change note
31 19-Oct-2007 00:37 7.075 kB 207.171.180.101 to previous Fixed grammar
30 19-Oct-2007 00:36 7.071 kB 207.171.180.101 to previous | to last Fixed grammar
29 19-Oct-2007 00:33 7.069 kB 207.171.180.101 to previous | to last Fixed spelling and grammar
28 26-Sep-2007 09:46 7.065 kB ChuckSmith to previous | to last restore
27 26-Sep-2007 01:53 7.098 kB 203.69.39.251 to previous | to last
26 26-Sep-2007 01:52 7.078 kB 60.250.153.11 to previous | to last
25 26-Apr-2007 08:23 7.065 kB ChristophSauer to previous | to last added link to proof of conept
24 23-Apr-2007 16:21 6.935 kB ChristophSauer to previous | to last typo
23 21-Apr-2007 15:41 6.938 kB ChristophSauer to previous | to last removed mistycal software -> factored out
22 20-Apr-2007 10:16 9.104 kB ChristophSauer to previous | to last Mystical: I was wrong, the werewolf exists.
21 19-Apr-2007 21:57 8.241 kB GregorHagedorn to previous | to last Mystic software...
« This page (revision-31) was last changed on 19-Okt-2007 00:37 by 207.171.180.101