all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
* Efficiency of Org v. LaTeX v. Word
@ 2014-12-26 22:47 Ken Mankoff
  2014-12-26 23:36 ` Thomas S. Dye
                   ` (7 more replies)
  0 siblings, 8 replies; 21+ messages in thread
From: Ken Mankoff @ 2014-12-26 22:47 UTC (permalink / raw)
  To: Org-mode mailing list


People here might be interested in a publication from [2014-12-19 Fri]
available at http://dx.doi.org/10.1371/journal.pone.0115069

Title: An Efficiency Comparison of Document Preparation Systems Used
in Academic Research and Development

Summary: Word users are more efficient and have less errors than even
experienced LaTeX users.

Someone here should repeat experiment and add Org into the mix, perhaps
Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
pandoc) beat straight Word?

  -k.

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-26 22:47 Efficiency of Org v. LaTeX v. Word Ken Mankoff
@ 2014-12-26 23:36 ` Thomas S. Dye
  2014-12-27  2:21   ` briangpowell .
  2014-12-27  3:26 ` Christopher W. Ryan
                   ` (6 subsequent siblings)
  7 siblings, 1 reply; 21+ messages in thread
From: Thomas S. Dye @ 2014-12-26 23:36 UTC (permalink / raw)
  To: Ken Mankoff; +Cc: Org-mode mailing list


,----------------------------------------------------------------------
| "One may also argue that given a well-designed LaTeX document class  
| file, document development speed and text and formatting accuracy are
| significantly improved."                                             
`----------------------------------------------------------------------

Apparently, the LaTeX users didn't have the benefit of a document
class.  Hard to take a "study" like this seriously.

,-----------------------------------------------------------------------
| "preventing researchers from producing documents in LaTeX would save  
| time and money to maximize the benefit of research and development for
| both the research team and the public"                                
`-----------------------------------------------------------------------

All you have to lose is your freedom.

All the best,
Tom

Ken Mankoff <mankoff@gmail.com> writes:

> People here might be interested in a publication from [2014-12-19 Fri]
> available at http://dx.doi.org/10.1371/journal.pone.0115069
>
> Title: An Efficiency Comparison of Document Preparation Systems Used
> in Academic Research and Development
>
> Summary: Word users are more efficient and have less errors than even
> experienced LaTeX users.
>
> Someone here should repeat experiment and add Org into the mix, perhaps
> Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
> Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
> pandoc) beat straight Word?
>
>   -k.
>
>
>

-- 
Thomas S. Dye
http://www.tsdye.com

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-26 23:36 ` Thomas S. Dye
@ 2014-12-27  2:21   ` briangpowell .
  2014-12-27 14:36     ` Eric S Fraga
  0 siblings, 1 reply; 21+ messages in thread
From: briangpowell . @ 2014-12-27  2:21 UTC (permalink / raw)
  To: Thomas S. Dye; +Cc: Org-mode mailing list, Ken Mankoff

[-- Attachment #1: Type: text/plain, Size: 2726 bytes --]

Word is a desktop publishing system.

LaTeX is a macro language which lays on top of TeX=Tau-Epsilon-Chi~Art in
Greek

TeX is computerized typesetting that enables vector graphics--you can get
TeX to draw anything you want--you can even create your own font.

More Math journals and books you'd find in the library are created using
TeX than any other software system.

The poor kerning and severe limitations of Word are too many to number here.

Word is in a different class of software, the 2 aren't comparable at all.

Word is a poor WYSIWYG software package that is good for low quality
desktop publishing, team collaboration but can be programmed and interacted
with through VB--its useful to the general public.

LaTeX provides precision and expression; there are things you can do with
TeX that aren't possible with Word.

Members of the Free Software community (which TeX has always been a part
of) will never bow down to the Micro$oft tyranny which is so evil words
can't express the depths of their corruption--the comparison is absurd.




On Fri, Dec 26, 2014 at 6:36 PM, Thomas S. Dye <tsd@tsdye.com> wrote:

>
> ,----------------------------------------------------------------------
> | "One may also argue that given a well-designed LaTeX document class
> | file, document development speed and text and formatting accuracy are
> | significantly improved."
> `----------------------------------------------------------------------
>
> Apparently, the LaTeX users didn't have the benefit of a document
> class.  Hard to take a "study" like this seriously.
>
> ,-----------------------------------------------------------------------
> | "preventing researchers from producing documents in LaTeX would save
> | time and money to maximize the benefit of research and development for
> | both the research team and the public"
> `-----------------------------------------------------------------------
>
> All you have to lose is your freedom.
>
> All the best,
> Tom
>
> Ken Mankoff <mankoff@gmail.com> writes:
>
> > People here might be interested in a publication from [2014-12-19 Fri]
> > available at http://dx.doi.org/10.1371/journal.pone.0115069
> >
> > Title: An Efficiency Comparison of Document Preparation Systems Used
> > in Academic Research and Development
> >
> > Summary: Word users are more efficient and have less errors than even
> > experienced LaTeX users.
> >
> > Someone here should repeat experiment and add Org into the mix, perhaps
> > Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
> > Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
> > pandoc) beat straight Word?
> >
> >   -k.
> >
> >
> >
>
> --
> Thomas S. Dye
> http://www.tsdye.com
>
>

[-- Attachment #2: Type: text/html, Size: 3793 bytes --]

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-26 22:47 Efficiency of Org v. LaTeX v. Word Ken Mankoff
  2014-12-26 23:36 ` Thomas S. Dye
@ 2014-12-27  3:26 ` Christopher W. Ryan
  2014-12-28 22:45   ` Bob Newell
  2014-12-27  4:27 ` Nick Dokos
                   ` (5 subsequent siblings)
  7 siblings, 1 reply; 21+ messages in thread
From: Christopher W. Ryan @ 2014-12-27  3:26 UTC (permalink / raw)
  To: Org-mode mailing list

This seems like more of a typing contest than anything else. Reproducing
a single page of an already-typeset document is not what LaTeX is
designed for, nor is it what scientists do for a living. The test
selections were absurdly short relative to the typical scientic
manuscript. Long and complex documents are where LaTeX excels. And this
did not call upon some of the most important (IMHO) capabilities of
LaTeX: managing citations with BibTex; changing the style to suit
different journals; storing, revisiting, and reusing your document years
later.

--Chris Ryan

Ken Mankoff wrote:
> 
> People here might be interested in a publication from [2014-12-19 Fri]
> available at http://dx.doi.org/10.1371/journal.pone.0115069
> 
> Title: An Efficiency Comparison of Document Preparation Systems Used
> in Academic Research and Development
> 
> Summary: Word users are more efficient and have less errors than even
> experienced LaTeX users.
> 
> Someone here should repeat experiment and add Org into the mix, perhaps
> Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
> Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
> pandoc) beat straight Word?
> 
>   -k.
> 
> 

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-26 22:47 Efficiency of Org v. LaTeX v. Word Ken Mankoff
  2014-12-26 23:36 ` Thomas S. Dye
  2014-12-27  3:26 ` Christopher W. Ryan
@ 2014-12-27  4:27 ` Nick Dokos
  2014-12-27  9:06   ` Peter Neilson
  2014-12-27  9:48 ` Achim Gratz
                   ` (4 subsequent siblings)
  7 siblings, 1 reply; 21+ messages in thread
From: Nick Dokos @ 2014-12-27  4:27 UTC (permalink / raw)
  To: emacs-orgmode

Ken Mankoff <mankoff@gmail.com> writes:

> People here might be interested in a publication from [2014-12-19 Fri]
> available at http://dx.doi.org/10.1371/journal.pone.0115069
>
> Title: An Efficiency Comparison of Document Preparation Systems Used
> in Academic Research and Development
>
> Summary: Word users are more efficient and have less errors than even
> experienced LaTeX users.
>
> Someone here should repeat experiment and add Org into the mix, perhaps
> Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
> Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
> pandoc) beat straight Word?
>

The "study" is deeply flawed: Word users typed more text in 30 minutes
than LaTeX users? 20% more? For straight text? I don't believe it: I
think it's much more likely that the Word users were better typists on
average and I didn't see any mention of normalizing the results by
taking that into account. And for LaTeX, the editing environment is
of paramount importance: did they mostly use vi, emacs, emacs+auctex?
Other than a couple of vague sentences in the "Discussion" section,
there is no mention of how this variable was (or was not) controlled.
And did the LaTeX users have to type the preamble or were they allowed
to use a template? It just seems unbelievable that there is such
a big difference for straight text.

I can believe perhaps that typing a table by hand into LaTeX is more
error prone than typing it into Word (although to be honest, I have
never done the latter, so I don't really know). With org and radio
tables, this would be a non-issue on the LaTeX side.

As for equations, even the authors admit that LaTeX is better, although
they tend to minimize the differences as statistically insignificant (at
least between the expert classes), which strikes me as somewhat suspect
as well: there seems to be a 10% difference between the expert user
averages and a bigger one for novices, although the error bars might
overlap in the first case (although they don't look it). I don't think
that even that would make the difference insignificant, but we'd have to
analyze their raw results to make sure (which they do provide and which
I took a look at, but afaict they don't provide answers to the questions
I raised above; maybe we should suggest that the authors use org and
reproducible results methods).

Anyway, color me deeply suspicious of the "study". 

-- 
Nick

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-27  4:27 ` Nick Dokos
@ 2014-12-27  9:06   ` Peter Neilson
  2014-12-27 14:38     ` Eric S Fraga
  0 siblings, 1 reply; 21+ messages in thread
From: Peter Neilson @ 2014-12-27  9:06 UTC (permalink / raw)
  To: emacs-orgmode

On Fri, 26 Dec 2014 23:27:37 -0500, Nick Dokos <ndokos@gmail.com> wrote:

> Anyway, color me deeply suspicious of the "study".

Indeed!

The study touches only a few of the inherent difficulties in document  
production. Its major flaw is that it draws any conclusions at all  
recommending that authors produce documents one way or another. Personally  
I am always disappointed when someone requests a document in MS Word  
format, because that means I'll have to fire up Libre Office and shove my  
text through it, rather than using whatever other system I happen to have  
been using. I do not believe that I currently own a system with genuine MS  
Word.

As well as having insufficient control of variables, and a flawed  
understanding of what is involved in "document preparation," the study  
also has a marginally small sample size. Any study for any purpose that  
presents "statistics" with sample sizes smaller than 30 is immediately  
suspect. I won't even begin to address the misinterpretation of  
correlation as causation that appears in the "softer" sciences, nor their  
necessity for sample sizes far larger than 100, nor the tendency in some  
fields to mistake a time series as a set of samples.

MS Word works extremely well for "one-off" small papers. Little investment  
of effort is required for a naive person to produce adequate results, and  
as every user of emacs knows, that's pretty much the opposite of emacs.

On the other hand, MS Word has historically been a terrible tool for  
producing large documents, or documents that are to be maintained by a  
group of people, or over several years or decades. Handling Word's "Master  
Document" provision without being crippled by corrupted documents is an  
art form unto itself. The standard advice among experienced users of Word  
has always been, "Don't Use Master Documents!" When a group of people are  
all editing versions of a document, any attempt to use standard formatting  
in Word requires substantial effort to prevent naive contributers from  
reformatting outside the established styles, or even breaking all the  
styles. Furthermore, Word documents are in general not amenable to  
incremental version control as commonly used by coding teams.

My conclusions? If your paper is trivial and you are under pressure to  
produce it quickly, then MS Word might be the best tool. Established  
journals should attempt to allow contributions in more than one format,  
and restriction to MS Word format is a bad idea, no matter how much some  
people like the apparent ease-of-use that MS Word provides. Attempting to  
extend the "study" to include org mode would be a waste of effort.

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-26 22:47 Efficiency of Org v. LaTeX v. Word Ken Mankoff
                   ` (2 preceding siblings ...)
  2014-12-27  4:27 ` Nick Dokos
@ 2014-12-27  9:48 ` Achim Gratz
  2014-12-27 10:05 ` Paul Rudin
                   ` (3 subsequent siblings)
  7 siblings, 0 replies; 21+ messages in thread
From: Achim Gratz @ 2014-12-27  9:48 UTC (permalink / raw)
  To: emacs-orgmode

Am 26.12.2014 um 23:47 schrieb Ken Mankoff:
> People here might be interested in a publication from [2014-12-19 Fri]
> available at http://dx.doi.org/10.1371/journal.pone.0115069
>
> Title: An Efficiency Comparison of Document Preparation Systems Used
> in Academic Research and Development
>
> Summary: Word users are more efficient and have less errors than even
> experienced LaTeX users.

The way researcher efficiency is defined in that "study" completely 
misses the purpose of scientific publishing and it goes downhill from 
there.  The statistics are pseudo-scientific smokes and mirrors, no 
control groups, no normalization and not a single hint of why it should 
be acceptable to use normal distributions for something that clearly 
isn't normally distributed other than the obvious convenience of drawing 
wild conclusions from a small sample size.

I'm still not sure if this isn't an elaborate joke, but I'm afraid not.

> Someone here should repeat experiment and add Org into the mix, perhaps
> Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
> Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
> pandoc) beat straight Word?

Repeating a deeply flawed "study" that seems designed to support some 
pre-conceived notion or preference of the authors isn't going to produce 
any new insights and I'm quite certain that there is better research 
into the differences of WYSIWIG vs. non-WYSIWIG publication systems 
and/or researcher efficiency.  If a reasearcher is nothing more than a 
typist that needs to produce pages of texts, tables and equations in a 
prescribed format in the least amount of time motivated by a monetary 
prize, we wouldn't need researchers at all.  That would incidentally 
save much more money than having them all switch from LaTeX to Word, so 
let's stop funding research.


-- 
Achim.

(on the road :-)

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-26 22:47 Efficiency of Org v. LaTeX v. Word Ken Mankoff
                   ` (3 preceding siblings ...)
  2014-12-27  9:48 ` Achim Gratz
@ 2014-12-27 10:05 ` Paul Rudin
  2014-12-27 10:36   ` M
  2014-12-27 13:37 ` Daniele Pizzolli
                   ` (2 subsequent siblings)
  7 siblings, 1 reply; 21+ messages in thread
From: Paul Rudin @ 2014-12-27 10:05 UTC (permalink / raw)
  To: emacs-orgmode-mXXj517/zsQ

Ken Mankoff <mankoff-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> writes:

> People here might be interested in a publication from [2014-12-19 Fri]
> available at http://dx.doi.org/10.1371/journal.pone.0115069
>
> Title: An Efficiency Comparison of Document Preparation Systems Used
> in Academic Research and Development
>
> Summary: Word users are more efficient and have less errors than even
> experienced LaTeX users.
>
> Someone here should repeat experiment and add Org into the mix, perhaps
> Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
> Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
> pandoc) beat straight Word?
>

No mention of emacs... who uses anything else to prepare their LaTeX?

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-27 10:05 ` Paul Rudin
@ 2014-12-27 10:36   ` M
  2014-12-27 11:36     ` Fabrice Popineau
  2014-12-31 18:19     ` Paul Rudin
  0 siblings, 2 replies; 21+ messages in thread
From: M @ 2014-12-27 10:36 UTC (permalink / raw)
  To: emacs orgmode-mailinglist; +Cc: Paul Rudin




> Von: Paul Rudin <paul@rudin.co.uk>
> Datum: Sat, 27 Dec 2014 10:05:19 +0000
> An: <emacs-orgmode@gnu.org>
> Betreff: Re: [O] Efficiency of Org v. LaTeX v. Word
> 
> Ken Mankoff <mankoff@gmail.com> writes:
> 
>> People here might be interested in a publication from [2014-12-19 Fri]
>> available at http://dx.doi.org/10.1371/journal.pone.0115069
>> 
>> Title: An Efficiency Comparison of Document Preparation Systems Used
>> in Academic Research and Development
>> 
>> Summary: Word users are more efficient and have less errors than even
>> experienced LaTeX users.
>> 
>> Someone here should repeat experiment and add Org into the mix, perhaps
>> Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
>> Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
>> pandoc) beat straight Word?
>> 
> 
> No mention of emacs... who uses anything else to prepare their LaTeX?
> 
Did you forget the " ;-)" or are you serious?

Emacs is for sure a very good one, but there are a lot of popular
alternatives, if you have a look at the (for sure not representative) voting
on the answers of this discussion here:

http://tex.stackexchange.com/questions/339/latex-editors-ides

(It's clear, that people may have voted for several of those editors, so
that no valid statistics at all, but at least an idea...)

Is there any real survey result about which editors LaTeX users use?

Martin

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-27 10:36   ` M
@ 2014-12-27 11:36     ` Fabrice Popineau
  2014-12-28 22:43       ` Pascal Fleury
  2014-12-31 18:19     ` Paul Rudin
  1 sibling, 1 reply; 21+ messages in thread
From: Fabrice Popineau @ 2014-12-27 11:36 UTC (permalink / raw)
  To: M; +Cc: Paul Rudin, emacs orgmode-mailinglist

[-- Attachment #1: Type: text/plain, Size: 3744 bytes --]

I agree that this study is certainly not large enough to draw strong
conclusions, but it raises a couple of questions
and some points may require attention.

I have spent many years in the TeX world. I see how lots of people use TeX
: students, professionals, researchers etc...
and I would easily draw 2 categories of people :
- those who are programmers "in their soul" (DEK once said that 2% or so of
the whole human race is gifted with programming, the same way some people
are gifted to play music etc.)
- those who use LaTeX "because it is the best typesetting system"
People who belong to the intersection of those 2 categories will certainly
be very efficient in producing documents with LaTeX, much more than what
this study shows.
But people from the first category may also be efficient in producing
documents with Word (Word is programmable too and the typesetting engine is
fancier than most people would believe).
The real problem is the guys from the second category who stick to use a
tool they are not comfortable with but they don't want to admit it.
Over the last years, I have seen more and more students come with LaTeX
documents which had a very poor appearance.
There has been a lot of pressure with the rise of Linux to use LaTeX.
Unfortunately the results of using LaTeX may not be up to the expectations.
The tool is too complex. It can produce beautiful documents when used
right, but it can also easily produce awful documents.
You can also spend a lot of time in fixing details, and it happens more
frequently than even proficient LaTeX users would admit.
In the end, I think the tendency is to a growing number of LaTeX users who
use it poorly.

Finally, today, my experience is that publishers charge much more for LaTeX
documents than for Word (or similar tools) documents and they are reluctant
to use LaTeX because of its complexity.

That was my $0.02

Fabrice

2014-12-27 11:36 GMT+01:00 M <Elwood151@web.de>:

> > Von: Paul Rudin <paul@rudin.co.uk>
> > Datum: Sat, 27 Dec 2014 10:05:19 +0000
> > An: <emacs-orgmode@gnu.org>
> > Betreff: Re: [O] Efficiency of Org v. LaTeX v. Word
> >
> > Ken Mankoff <mankoff@gmail.com> writes:
> >
> >> People here might be interested in a publication from [2014-12-19 Fri]
> >> available at http://dx.doi.org/10.1371/journal.pone.0115069
> >>
> >> Title: An Efficiency Comparison of Document Preparation Systems Used
> >> in Academic Research and Development
> >>
> >> Summary: Word users are more efficient and have less errors than even
> >> experienced LaTeX users.
> >>
> >> Someone here should repeat experiment and add Org into the mix, perhaps
> >> Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
> >> Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
> >> pandoc) beat straight Word?
> >>
> >
> > No mention of emacs... who uses anything else to prepare their LaTeX?
> >
> Did you forget the " ;-)" or are you serious?
>
> Emacs is for sure a very good one, but there are a lot of popular
> alternatives, if you have a look at the (for sure not representative)
> voting
> on the answers of this discussion here:
>
> http://tex.stackexchange.com/questions/339/latex-editors-ides
>
> (It's clear, that people may have voted for several of those editors, so
> that no valid statistics at all, but at least an idea...)
>
> Is there any real survey result about which editors LaTeX users use?
>
> Martin
>
>
>
>


-- 
Fabrice Popineau
-----------------------------
SUPELEC
Département Informatique
3, rue Joliot Curie
91192 Gif/Yvette Cedex
Tel direct : +33 (0) 169851950
Standard : +33 (0) 169851212
------------------------------

[-- Attachment #2: Type: text/html, Size: 4980 bytes --]

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-26 22:47 Efficiency of Org v. LaTeX v. Word Ken Mankoff
                   ` (4 preceding siblings ...)
  2014-12-27 10:05 ` Paul Rudin
@ 2014-12-27 13:37 ` Daniele Pizzolli
  2014-12-28 21:40 ` Efficiency of Org v. LaTeX v. Word ---LOOK AT THE DATA! Christophe Pouzat
  2015-01-04 20:38 ` Efficiency of Org v. LaTeX v. Word John Kitchin
  7 siblings, 0 replies; 21+ messages in thread
From: Daniele Pizzolli @ 2014-12-27 13:37 UTC (permalink / raw)
  To: Org-mode mailing list

Hello,

Ken Mankoff writes:

> People here might be interested in a publication from [2014-12-19 Fri]
> available at http://dx.doi.org/10.1371/journal.pone.0115069
>
> Title: An Efficiency Comparison of Document Preparation Systems Used
> in Academic Research and Development
>
> Summary: Word users are more efficient and have less errors than even
> experienced LaTeX users.

As other said, the efficiency in the paper is about the manual copy of a
small portion of text, tables...  This is a little bit different to
publish a research, maybe a reproducible one, with the help of a team.

> Someone here should repeat experiment and add Org into the mix, perhaps
> Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
> Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
> pandoc) beat straight Word?

Repeating a flawed experiment do not add a lot of value...

Best,
Daniele

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-27  2:21   ` briangpowell .
@ 2014-12-27 14:36     ` Eric S Fraga
  0 siblings, 0 replies; 21+ messages in thread
From: Eric S Fraga @ 2014-12-27 14:36 UTC (permalink / raw)
  To: Org-mode mailing list

On Friday, 26 Dec 2014 at 21:21, briangpowell . wrote:

[...]

> Word is in a different class of software, the 2 aren't comparable at all.

Indeed.  Nonsense article, in my opinion.  Comparing apples and oranges.

In any case, as my writing is very highly equation based, I think I'll
stick to LaTeX...

(using org, of course)

-- 
: Eric S Fraga (0xFFFCF67D), Emacs 25.0.50.1, Org release_8.3beta-581-g0e52f0

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-27  9:06   ` Peter Neilson
@ 2014-12-27 14:38     ` Eric S Fraga
  0 siblings, 0 replies; 21+ messages in thread
From: Eric S Fraga @ 2014-12-27 14:38 UTC (permalink / raw)
  To: Peter Neilson; +Cc: emacs-orgmode

On Saturday, 27 Dec 2014 at 04:06, Peter Neilson wrote:

[...]

> My conclusions? If your paper is trivial and you are under pressure to
> produce it quickly, then MS Word might be the best tool.

Actually, I don't think I can get any faster than using org for a
trivial paper needed quickly...  I'm in emacs already ;-)
-- 
: Eric S Fraga (0xFFFCF67D), Emacs 25.0.50.1, Org release_8.3beta-581-g0e52f0

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word ---LOOK AT THE DATA!
  2014-12-26 22:47 Efficiency of Org v. LaTeX v. Word Ken Mankoff
                   ` (5 preceding siblings ...)
  2014-12-27 13:37 ` Daniele Pizzolli
@ 2014-12-28 21:40 ` Christophe Pouzat
  2014-12-29 19:47   ` Thomas S. Dye
  2014-12-31 16:59   ` Colin Baxter
  2015-01-04 20:38 ` Efficiency of Org v. LaTeX v. Word John Kitchin
  7 siblings, 2 replies; 21+ messages in thread
From: Christophe Pouzat @ 2014-12-28 21:40 UTC (permalink / raw)
  To: emacs-orgmode

[-- Attachment #1: Type: text/plain, Size: 13854 bytes --]

Hi all,

After seeing Ken's mail:

Le 26/12/2014 23:47, Ken Mankoff a écrit :
> People here might be interested in a publication from [2014-12-19 Fri]
> available at http://dx.doi.org/10.1371/journal.pone.0115069
>
> Title: An Efficiency Comparison of Document Preparation Systems Used
> in Academic Research and Development
>
> Summary: Word users are more efficient and have less errors than even
> experienced LaTeX users.
>
> Someone here should repeat experiment and add Org into the mix, perhaps
> Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
> Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
> pandoc) beat straight Word?
>
>    -k.
>
>
and some of replies it triggered on the list, I went to check the paper. 
As many of you guys I found some "results" puzzling in particular:
1. the use of bar graphs when the data would better be displayed 
directly (that qualifies immediately the paper as "low quality" for me).
2. the larger error bars observed for LaTeX when compared to Word.
3. the systematic inverse relationship between the blue and pink bars 
heights.

So I went to figshare to download the data and looked at them. A quick 
and dirty "analysis" is attached to this mail in PDF format (generated 
with org, of course, and this awful software called LaTeX!) and the 
source org file can be found at the bottom of this mail. I used R to do 
the figures (and I'm sure the authors of the paper will then criticize 
me for not using Excel with which everyone knows errors are generated 
much more efficiently).

I managed to understand the inverse relationship in point 3 above: the 
authors considered 3 types of mistakes / errors:
1. Formatting and typos error.
2. Orthographic and grammatical errors.
3. Missing words and signs.
Clearly, following the mail of Tom (Dye) on the list and on the Plos web 
site, I would argue that formatting errors in LaTeX are bona fide bugs. 
But the point I want to make is that the third source accounts for 80% 
of the total errors (what's shown in pink bars in the paper) and clearly 
the authors counted what the subjects did not have time to type as an 
error of this type. Said differently, the blue and pink bars are showing 
systematically the same thing by construction! The second type of error 
in not a LaTeX issue (and in fact does not differ significantly from the 
Word case) but an "environment" issue (what spelling corrector had the 
LaTeX users access to?).

There is another strange thing in the table copy case. For both the 
expert and novice group in LaTeX, there is one among 10 subjects that 
did produce 0% of the table but still manage to produce 22 typographic 
errors!

The overall worst performance of LaTeX users remains to be explained and 
as mentioned in on the mails in the list, that does not make sense at 
least for the continuous text exercise. The method section of the paper 
is too vague but my guess is that some LaTeX users did attempt to 
reproduce the exact layout of the text they had to copy, something LaTeX 
is definitely not design to provide quickly.

One more point: how many of you guys could specify their total number of 
hours of experience with LaTeX (or any other software you are currently 
using)? That what the subjects of this study had to specify...

Let me know what you think,

Christophe

My org buffer:

#+TITLE: An Efficiency Comparison of Document Preparation Systems Used 
in Academic Research and Development: A Re-analysis.
#+DATE: <2014-12-28 dim.>
#+AUTHOR: Christophe Pouzat
#+EMAIL: christophe.pouzat@gmail.com
#+OPTIONS: ':nil *:t -:t ::t <:t H:3 \n:nil ^:t arch:headline
#+OPTIONS: author:t c:nil creator:comment d:(not "LOGBOOK") date:t
#+OPTIONS: e:t email:nil f:t inline:t num:t p:nil pri:nil stat:t
#+OPTIONS: tags:t tasks:t tex:t timestamp:t toc:nil todo:t |:t
#+CREATOR: Emacs 24.4.1 (Org mode 8.2.10)
#+DESCRIPTION:
#+EXCLUDE_TAGS: noexport
#+KEYWORDS:
#+LANGUAGE: en
#+SELECT_TAGS: export
#+LaTeX_HEADER: \usepackage{alltt}
#+LaTeX_HEADER: \usepackage[usenames,dvipsnames]{xcolor}
#+LaTeX_HEADER: \renewenvironment{verbatim}{\begin{alltt} \scriptsize 
\color{Bittersweet} \vspace{0.2cm} }{\vspace{0.2cm} \end{alltt} 
\normalsize \color{black}}
#+LaTeX_HEADER: \definecolor{lightcolor}{gray}{.55}
#+LaTeX_HEADER: \definecolor{shadecolor}{gray}{.85}
#+LaTeX_HEADER: \usepackage{minted}
#+LaTeX_HEADER: \hypersetup{colorlinks=true}

#+NAME: org-latex-set-up
#+BEGIN_SRC emacs-lisp :results silent :exports none
(setq org-latex-listings 'minted)
(setq org-latex-minted-options
       '(("bgcolor" "shadecolor")
     ("fontsize" "\\scriptsize")))
(setq org-latex-pdf-process
       '("pdflatex -shell-escape -interaction nonstopmode 
-output-directory %o %f"
     "biber %b"
     "pdflatex -shell-escape -interaction nonstopmode -output-directory 
%o %f"
     "pdflatex -shell-escape -interaction nonstopmode -output-directory 
%o %f"))
#+END_SRC

* Introduction
This is a re-analysis of the data presented in 
[[http://dx.doi.org/10.1371/journal.pone.0115069][An Efficiency 
Comparison of Document Preparation Systems Used in Academic Research and 
Development]]. My "interest" in this paper was triggered by a discussion 
on the [[http://article.gmane.org/gmane.emacs.orgmode/93655][emacs org 
mode mailing list]]. Ignoring the "message" of the paper, what stroke me 
was the systematic use of bar graphs: a way of displaying data that 
*should never be used* since when many observations are considered, a 
box plot is going to do a much better job and when, like in the present 
paper, few (10 in each of the 4 categories) observations are available, 
a direct display or even a simple table is going to do a *much better* 
job. Since it turns out that the data are available both on the Plos web 
site and on 
[[http://figshare.com/articles/_An_Efficiency_Comparison_of_Document_Preparation_Systems_Used_in_Academic_Research_and_Development_/1275631][figshare]], 
I decided to re-analyze them.

* Getting the data, etc.

We get the data with:

#+BEGIN_SRC sh
wget http://files.figshare.com/1849394/S1_Materials.xlsx
#+END_SRC

#+RESULTS:
Using for instance [[http://dag.wiee.rs/home-made/unoconv/][unoconv]], 
we can convert the =Excel= file into a friendlier =csv= file:

#+BEGIN_SRC sh
unoconv -f csv S1_Materials.xlsx
#+END_SRC

#+RESULTS:
We then get the data with =R= =read.csv= function:

#+NAME: data-table
#+BEGIN_SRC R :session *R* :results silent
efficiency <- read.csv("S1_Materials.csv",header=TRUE,dec=",")
#+END_SRC
The description of this table is obtained with:

#+BEGIN_SRC sh :exports both :results output
wget http://files.figshare.com/1849395/S2_Materials.txt
cat "S2_Materials.txt"
#+END_SRC

* Making some figures
We can now make a figure out of the same data as figures 4, 5 and 6 of 
the paper but showing the actual data. We start with the "continuous 
text" exercise. We represent, in each of the four categories, each of 
the 10 individuals by a number between 0 and 9. Some horizontal jitter 
has been added to avoid overlaps. Category 1 corresponds to expert 
=Word= users; 2 to novice =Word= users; 3 to expert \LaTeX{} users; 4 to 
novice \LaTeX{} users:

#+HEADER: :file continuous.png :width 1000 :height 1000
#+BEGIN_SRC R :session *R* :exports both :results output graphics
layout(matrix(1:4,nc=2,byrow=TRUE))
par(cex=2)
plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),ylim=c(0,100),
      xlab="User category",ylab="",main="Fraction of text")
with(efficiency,
      sapply(1:4,
             function(k) points(runif(10,k-0.2,k+0.2),
                                PROZENT1[Kenntnisse==k],
                                pch = paste(0:9))))

with(efficiency,
      plot(c(1,4),c(0,100),type="n",
           xlim=c(0.5,4.5),ylim=range(FEHLERSFT),xlab="User category",
           ylab="",main="Formatting errors and typos"))
with(efficiency,
      sapply(1:4,
             function(k) points(runif(10,k-0.2,k+0.2),
                                FEHLERSFT[Kenntnisse==k],
                                pch = paste(0:9))))

with(efficiency,
      plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),
           ylim=range(FEHLEROFT),xlab="User category",ylab="",
           main="Orthographic and grammatical mistakes"))
with(efficiency,
      sapply(1:4,
             function(k) points(runif(10,k-0.2,k+0.2),
                                FEHLEROFT[Kenntnisse==k],
                                pch = paste(0:9))))

with(efficiency,
plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),ylim=range(FEHLENDFT),
           xlab="User category",ylab="",main="Missing words and signs"))
with(efficiency,
      sapply(1:4,
             function(k) points(runif(10,k-0.2,k+0.2),
                                FEHLENDFT[Kenntnisse==k],
                                pch = paste(0:9))))
#+END_SRC


Notice that the number of "missing words and signs" exactly mirrors the 
fraction of written text. We will see that this observation holds for 
the two following exercises. This "missing words and signs" is always 
roughly ten times as large as the two other sources of mistakes. This 
explains the inverse relationship between the blue and pink bars on each 
of the 3 figures.

Let's keep going with the "table exercise":

#+HEADER: :file table.png :width 1000 :height 1000
#+BEGIN_SRC R :session *R* :exports both :results output graphics
layout(matrix(1:4,nc=2,byrow=TRUE))
par(cex=2)
plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),ylim=c(0,100),
      xlab="User category",ylab="",main="Fraction of text")
with(efficiency,sapply(1:4,
                        function(k) points(runif(10,k-0.2,k+0.2),
                                           PROZENT2[Kenntnisse==k],
                                           pch = paste(0:9))))

with(efficiency,plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),
                      ylim=range(FEHLERST),xlab="User category",
                      ylab="",main="Formatting errors and typos"))
with(efficiency,sapply(1:4,
                        function(k) points(runif(10,k-0.2,k+0.2),
                                           FEHLERST[Kenntnisse==k],
                                           pch = paste(0:9))))

with(efficiency,plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),
                      ylim=range(FEHLEROT),xlab="User category",
                      ylab="",main="Orthographic and grammatical mistakes"))
with(efficiency,sapply(1:4,
                        function(k) points(runif(10,k-0.2,k+0.2),
                                           FEHLEROT[Kenntnisse==k],
                                           pch = paste(0:9))))

with(efficiency,plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),
                      ylim=range(FEHLENDT),xlab="User category",ylab="",
                      main="Missing words and signs"))
with(efficiency,sapply(1:4,
                        function(k) points(runif(10,k-0.2,k+0.2),
                                           FEHLENDT[Kenntnisse==k],
                                           pch = paste(0:9))))
#+END_SRC

We also see a strange thing here: in each of the expert \LaTeX{} and the 
novice \LaTeX{} users we have one individual who did not right anything 
but still manage to produce 22 "formatting errors and typos" (!) but 
luckily no orthographic or grammatical error...

#+BEGIN_SRC R :session *R* :exports both
with(efficiency,cbind(c(PROZENT2[Kenntnisse==3][10],
                         FEHLERST[Kenntnisse==3][10],
                         FEHLEROT[Kenntnisse==3][10],
                         FEHLENDT[Kenntnisse==3][10]),
                       c(PROZENT2[Kenntnisse==4][7],
                         FEHLERST[Kenntnisse==4][7],
                         FEHLEROT[Kenntnisse==4][7],
                         FEHLENDT[Kenntnisse==4][7])))
#+END_SRC


Now for the "equations" exercise:

#+HEADER: :file equation.png :width 1000 :height 1000
#+BEGIN_SRC R :session *R* :exports both :results output graphics
layout(matrix(1:4,nc=2,byrow=TRUE))
par(cex=2)
plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),ylim=c(0,100),
      xlab="User category",ylab="",main="Fraction of text")
with(efficiency,
      sapply(1:4,
             function(k) points(runif(10,k-0.2,k+0.2),
                                PROZENT3[Kenntnisse==k],
                                pch = paste(0:9))))

with(efficiency,
      plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),
           ylim=range(FEHLERSFOR),xlab="User category",ylab="",
           main="Formatting errors and typos"))
with(efficiency,
      sapply(1:4,
             function(k) points(runif(10,k-0.2,k+0.2),
                                FEHLERSFOR[Kenntnisse==k],
                                pch = paste(0:9))))

with(efficiency,
plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),ylim=range(FEHLEROFOR),
           xlab="User category",ylab="",
           main="Orthographic and grammatical mistakes"))
with(efficiency,
      sapply(1:4,
             function(k) points(runif(10,k-0.2,k+0.2),
                                FEHLEROFOR[Kenntnisse==k],
                                pch = paste(0:9))))

with(efficiency,
      plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),
           ylim=range(FEHLENDFOR),xlab="User category",ylab="",
           main="Missing words and signs"))
with(efficiency,
      sapply(1:4,
             function(k) points(runif(10,k-0.2,k+0.2),
                                FEHLENDFOR[Kenntnisse==k],
                                pch = paste(0:9))))
#+END_SRC



-- 
A Master Carpenter has many tools and is expert with most of them. If you only know how to use a hammer, every problem starts to look like a nail. Stay away from that trap.

Richard B Johnson.

--

Christophe Pouzat
MAP5 - Mathématiques Appliquées à Paris 5
CNRS UMR 8145
45, rue des Saints-Pères
75006 PARIS
France

tel: +33142863828
mobile: +33662941034
web: http://xtof.disque.math.cnrs.fr


[-- Attachment #2: EfficiencyComparison.pdf --]
[-- Type: application/pdf, Size: 218418 bytes --]

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-27 11:36     ` Fabrice Popineau
@ 2014-12-28 22:43       ` Pascal Fleury
  0 siblings, 0 replies; 21+ messages in thread
From: Pascal Fleury @ 2014-12-28 22:43 UTC (permalink / raw)
  To: Fabrice Popineau; +Cc: M, Paul Rudin, emacs orgmode-mailinglist

[-- Attachment #1: Type: text/plain, Size: 4685 bytes --]

On Sat, Dec 27, 2014 at 12:36 PM, Fabrice Popineau <
fabrice.popineau@supelec.fr> wrote:

> I agree that this study is certainly not large enough to draw strong
> conclusions, but it raises a couple of questions
> and some points may require attention.
>
> I have spent many years in the TeX world. I see how lots of people use TeX
> : students, professionals, researchers etc...
> and I would easily draw 2 categories of people :
> - those who are programmers "in their soul" (DEK once said that 2% or so
> of the whole human race is gifted with programming, the same way some
> people are gifted to play music etc.)
> - those who use LaTeX "because it is the best typesetting system"
> People who belong to the intersection of those 2 categories will certainly
> be very efficient in producing documents with LaTeX, much more than what
> this study shows.
> But people from the first category may also be efficient in producing
> documents with Word (Word is programmable too and the typesetting engine is
> fancier than most people would believe).
>

That is funny, as I still face regularly Word typeset documents that do not
handle orphan lines properly, and have at least 2 fonts as "body text".
Easy to fix, but a non-issue in Latex.

As a researcher, handling references and cross-references is not something
that is "amortized" on a one-off paper, it's something that pays off over a
few documents. And in a publish-or-perish world, this does usually not take
long.

As a programmer, I like to be able to run one command (call it 'make' if
you wish...) that will run some analysis and recompute both the figures and
the document into a new version, possibly versionned.

And now you know why I use orgmode too...

--paf




> The real problem is the guys from the second category who stick to use a
> tool they are not comfortable with but they don't want to admit it.
> Over the last years, I have seen more and more students come with LaTeX
> documents which had a very poor appearance.
> There has been a lot of pressure with the rise of Linux to use LaTeX.
> Unfortunately the results of using LaTeX may not be up to the expectations.
> The tool is too complex. It can produce beautiful documents when used
> right, but it can also easily produce awful documents.
> You can also spend a lot of time in fixing details, and it happens more
> frequently than even proficient LaTeX users would admit.
> In the end, I think the tendency is to a growing number of LaTeX users who
> use it poorly.
>
> Finally, today, my experience is that publishers charge much more for
> LaTeX documents than for Word (or similar tools) documents and they are
> reluctant to use LaTeX because of its complexity.
>
> That was my $0.02
>
> Fabrice
>
> 2014-12-27 11:36 GMT+01:00 M <Elwood151@web.de>:
>
>> > Von: Paul Rudin <paul@rudin.co.uk>
>> > Datum: Sat, 27 Dec 2014 10:05:19 +0000
>> > An: <emacs-orgmode@gnu.org>
>> > Betreff: Re: [O] Efficiency of Org v. LaTeX v. Word
>> >
>> > Ken Mankoff <mankoff@gmail.com> writes:
>> >
>> >> People here might be interested in a publication from [2014-12-19 Fri]
>> >> available at http://dx.doi.org/10.1371/journal.pone.0115069
>> >>
>> >> Title: An Efficiency Comparison of Document Preparation Systems Used
>> >> in Academic Research and Development
>> >>
>> >> Summary: Word users are more efficient and have less errors than even
>> >> experienced LaTeX users.
>> >>
>> >> Someone here should repeat experiment and add Org into the mix, perhaps
>> >> Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
>> >> Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
>> >> pandoc) beat straight Word?
>> >>
>> >
>> > No mention of emacs... who uses anything else to prepare their LaTeX?
>> >
>> Did you forget the " ;-)" or are you serious?
>>
>> Emacs is for sure a very good one, but there are a lot of popular
>> alternatives, if you have a look at the (for sure not representative)
>> voting
>> on the answers of this discussion here:
>>
>> http://tex.stackexchange.com/questions/339/latex-editors-ides
>>
>> (It's clear, that people may have voted for several of those editors, so
>> that no valid statistics at all, but at least an idea...)
>>
>> Is there any real survey result about which editors LaTeX users use?
>>
>> Martin
>>
>>
>>
>>
>
>
> --
> Fabrice Popineau
> -----------------------------
> SUPELEC
> Département Informatique
> 3, rue Joliot Curie
> 91192 Gif/Yvette Cedex
> Tel direct : +33 (0) 169851950
> Standard : +33 (0) 169851212
> ------------------------------
>
>

[-- Attachment #2: Type: text/html, Size: 6655 bytes --]

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-27  3:26 ` Christopher W. Ryan
@ 2014-12-28 22:45   ` Bob Newell
  0 siblings, 0 replies; 21+ messages in thread
From: Bob Newell @ 2014-12-28 22:45 UTC (permalink / raw)
  To: Org-mode mailing list


The "study" is an obvious diatribe couched in (poorly done) scientific
method. It almost seems like these researchers have at some time been
required to use LaTeX and are angry over it.

I will agree that LaTeX is slower and less efficient than LibreOffice (I
don't have Word on any of my computers but the argument I suppose is the
same), if your only interest is pounding out text of a first draft. But
the "study" didn't allow for the effects of proofreading, etc., and all
the things that are always done when producing something for
publication. I expect that the differences in grammar and orthographic
errors will be insignificant.

A 30-minute test is ridiculous. More meaningful would have been
end-to-end time to complete a given document. Maybe then Word would have
still been faster; I don't know. But that doesn't tell the whole story
by any means, including the very important matter of long-term storage
in a non-proprietary format.

-- 
Bob Newell
Honolulu, Hawai`i
* Sent via Ma Gnus 0.12-Emacs 24.3-Linux Mint 17 *

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word ---LOOK AT THE DATA!
  2014-12-28 21:40 ` Efficiency of Org v. LaTeX v. Word ---LOOK AT THE DATA! Christophe Pouzat
@ 2014-12-29 19:47   ` Thomas S. Dye
  2014-12-31 16:59   ` Colin Baxter
  1 sibling, 0 replies; 21+ messages in thread
From: Thomas S. Dye @ 2014-12-29 19:47 UTC (permalink / raw)
  To: Christophe Pouzat; +Cc: emacs-orgmode

Aloha Christophe,

I think you make a good case for the authors' poor choice of metrics.
These aren't well defined in the paper, so it is enlightening to see
your graphics and learn how their metrics were ineptly designed.

I hope you'll make your findings known to the PLOS audience.  This looks
to me like a clear case of peer review failure (see
http://www.sciencemag.org/content/342/6154/60.summary for an eye-opening
look at what's become of peer review.)

All the best,
Tom

Christophe Pouzat <christophe.pouzat@gmail.com> writes:

> Hi all,
>
> After seeing Ken's mail:
>
> Le 26/12/2014 23:47, Ken Mankoff a écrit :
>> People here might be interested in a publication from [2014-12-19 Fri]
>> available at http://dx.doi.org/10.1371/journal.pone.0115069
>>
>> Title: An Efficiency Comparison of Document Preparation Systems Used
>> in Academic Research and Development
>>
>> Summary: Word users are more efficient and have less errors than even
>> experienced LaTeX users.
>>
>> Someone here should repeat experiment and add Org into the mix, perhaps
>> Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
>> Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
>> pandoc) beat straight Word?
>>
>>    -k.
>>
>>
> and some of replies it triggered on the list, I went to check the paper. 
> As many of you guys I found some "results" puzzling in particular:
> 1. the use of bar graphs when the data would better be displayed 
> directly (that qualifies immediately the paper as "low quality" for me).
> 2. the larger error bars observed for LaTeX when compared to Word.
> 3. the systematic inverse relationship between the blue and pink bars 
> heights.
>
> So I went to figshare to download the data and looked at them. A quick 
> and dirty "analysis" is attached to this mail in PDF format (generated 
> with org, of course, and this awful software called LaTeX!) and the 
> source org file can be found at the bottom of this mail. I used R to do 
> the figures (and I'm sure the authors of the paper will then criticize 
> me for not using Excel with which everyone knows errors are generated 
> much more efficiently).
>
> I managed to understand the inverse relationship in point 3 above: the 
> authors considered 3 types of mistakes / errors:
> 1. Formatting and typos error.
> 2. Orthographic and grammatical errors.
> 3. Missing words and signs.
> Clearly, following the mail of Tom (Dye) on the list and on the Plos web 
> site, I would argue that formatting errors in LaTeX are bona fide bugs. 
> But the point I want to make is that the third source accounts for 80% 
> of the total errors (what's shown in pink bars in the paper) and clearly 
> the authors counted what the subjects did not have time to type as an 
> error of this type. Said differently, the blue and pink bars are showing 
> systematically the same thing by construction! The second type of error 
> in not a LaTeX issue (and in fact does not differ significantly from the 
> Word case) but an "environment" issue (what spelling corrector had the 
> LaTeX users access to?).
>
> There is another strange thing in the table copy case. For both the 
> expert and novice group in LaTeX, there is one among 10 subjects that 
> did produce 0% of the table but still manage to produce 22 typographic 
> errors!
>
> The overall worst performance of LaTeX users remains to be explained and 
> as mentioned in on the mails in the list, that does not make sense at 
> least for the continuous text exercise. The method section of the paper 
> is too vague but my guess is that some LaTeX users did attempt to 
> reproduce the exact layout of the text they had to copy, something LaTeX 
> is definitely not design to provide quickly.
>
> One more point: how many of you guys could specify their total number of 
> hours of experience with LaTeX (or any other software you are currently 
> using)? That what the subjects of this study had to specify...
>
> Let me know what you think,
>
> Christophe
>
> My org buffer:
>
> #+TITLE: An Efficiency Comparison of Document Preparation Systems Used 
> in Academic Research and Development: A Re-analysis.
> #+DATE: <2014-12-28 dim.>
> #+AUTHOR: Christophe Pouzat
> #+EMAIL: christophe.pouzat@gmail.com
> #+OPTIONS: ':nil *:t -:t ::t <:t H:3 \n:nil ^:t arch:headline
> #+OPTIONS: author:t c:nil creator:comment d:(not "LOGBOOK") date:t
> #+OPTIONS: e:t email:nil f:t inline:t num:t p:nil pri:nil stat:t
> #+OPTIONS: tags:t tasks:t tex:t timestamp:t toc:nil todo:t |:t
> #+CREATOR: Emacs 24.4.1 (Org mode 8.2.10)
> #+DESCRIPTION:
> #+EXCLUDE_TAGS: noexport
> #+KEYWORDS:
> #+LANGUAGE: en
> #+SELECT_TAGS: export
> #+LaTeX_HEADER: \usepackage{alltt}
> #+LaTeX_HEADER: \usepackage[usenames,dvipsnames]{xcolor}
> #+LaTeX_HEADER: \renewenvironment{verbatim}{\begin{alltt} \scriptsize 
> \color{Bittersweet} \vspace{0.2cm} }{\vspace{0.2cm} \end{alltt} 
> \normalsize \color{black}}
> #+LaTeX_HEADER: \definecolor{lightcolor}{gray}{.55}
> #+LaTeX_HEADER: \definecolor{shadecolor}{gray}{.85}
> #+LaTeX_HEADER: \usepackage{minted}
> #+LaTeX_HEADER: \hypersetup{colorlinks=true}
>
> #+NAME: org-latex-set-up
> #+BEGIN_SRC emacs-lisp :results silent :exports none
> (setq org-latex-listings 'minted)
> (setq org-latex-minted-options
>        '(("bgcolor" "shadecolor")
>      ("fontsize" "\\scriptsize")))
> (setq org-latex-pdf-process
>        '("pdflatex -shell-escape -interaction nonstopmode 
> -output-directory %o %f"
>      "biber %b"
>      "pdflatex -shell-escape -interaction nonstopmode -output-directory 
> %o %f"
>      "pdflatex -shell-escape -interaction nonstopmode -output-directory 
> %o %f"))
> #+END_SRC
>
> * Introduction
> This is a re-analysis of the data presented in 
> [[http://dx.doi.org/10.1371/journal.pone.0115069][An Efficiency 
> Comparison of Document Preparation Systems Used in Academic Research and 
> Development]]. My "interest" in this paper was triggered by a discussion 
> on the [[http://article.gmane.org/gmane.emacs.orgmode/93655][emacs org 
> mode mailing list]]. Ignoring the "message" of the paper, what stroke me 
> was the systematic use of bar graphs: a way of displaying data that 
> *should never be used* since when many observations are considered, a 
> box plot is going to do a much better job and when, like in the present 
> paper, few (10 in each of the 4 categories) observations are available, 
> a direct display or even a simple table is going to do a *much better* 
> job. Since it turns out that the data are available both on the Plos web 
> site and on 
> [[http://figshare.com/articles/_An_Efficiency_Comparison_of_Document_Preparation_Systems_Used_in_Academic_Research_and_Development_/1275631][figshare]], 
> I decided to re-analyze them.
>
> * Getting the data, etc.
>
> We get the data with:
>
> #+BEGIN_SRC sh
> wget http://files.figshare.com/1849394/S1_Materials.xlsx
> #+END_SRC
>
> #+RESULTS:
> Using for instance [[http://dag.wiee.rs/home-made/unoconv/][unoconv]], 
> we can convert the =Excel= file into a friendlier =csv= file:
>
> #+BEGIN_SRC sh
> unoconv -f csv S1_Materials.xlsx
> #+END_SRC
>
> #+RESULTS:
> We then get the data with =R= =read.csv= function:
>
> #+NAME: data-table
> #+BEGIN_SRC R :session *R* :results silent
> efficiency <- read.csv("S1_Materials.csv",header=TRUE,dec=",")
> #+END_SRC
> The description of this table is obtained with:
>
> #+BEGIN_SRC sh :exports both :results output
> wget http://files.figshare.com/1849395/S2_Materials.txt
> cat "S2_Materials.txt"
> #+END_SRC
>
> * Making some figures
> We can now make a figure out of the same data as figures 4, 5 and 6 of 
> the paper but showing the actual data. We start with the "continuous 
> text" exercise. We represent, in each of the four categories, each of 
> the 10 individuals by a number between 0 and 9. Some horizontal jitter 
> has been added to avoid overlaps. Category 1 corresponds to expert 
> =Word= users; 2 to novice =Word= users; 3 to expert \LaTeX{} users; 4 to 
> novice \LaTeX{} users:
>
> #+HEADER: :file continuous.png :width 1000 :height 1000
> #+BEGIN_SRC R :session *R* :exports both :results output graphics
> layout(matrix(1:4,nc=2,byrow=TRUE))
> par(cex=2)
> plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),ylim=c(0,100),
>       xlab="User category",ylab="",main="Fraction of text")
> with(efficiency,
>       sapply(1:4,
>              function(k) points(runif(10,k-0.2,k+0.2),
>                                 PROZENT1[Kenntnisse==k],
>                                 pch = paste(0:9))))
>
> with(efficiency,
>       plot(c(1,4),c(0,100),type="n",
>            xlim=c(0.5,4.5),ylim=range(FEHLERSFT),xlab="User category",
>            ylab="",main="Formatting errors and typos"))
> with(efficiency,
>       sapply(1:4,
>              function(k) points(runif(10,k-0.2,k+0.2),
>                                 FEHLERSFT[Kenntnisse==k],
>                                 pch = paste(0:9))))
>
> with(efficiency,
>       plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),
>            ylim=range(FEHLEROFT),xlab="User category",ylab="",
>            main="Orthographic and grammatical mistakes"))
> with(efficiency,
>       sapply(1:4,
>              function(k) points(runif(10,k-0.2,k+0.2),
>                                 FEHLEROFT[Kenntnisse==k],
>                                 pch = paste(0:9))))
>
> with(efficiency,
> plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),ylim=range(FEHLENDFT),
>            xlab="User category",ylab="",main="Missing words and signs"))
> with(efficiency,
>       sapply(1:4,
>              function(k) points(runif(10,k-0.2,k+0.2),
>                                 FEHLENDFT[Kenntnisse==k],
>                                 pch = paste(0:9))))
> #+END_SRC
>
>
> Notice that the number of "missing words and signs" exactly mirrors the 
> fraction of written text. We will see that this observation holds for 
> the two following exercises. This "missing words and signs" is always 
> roughly ten times as large as the two other sources of mistakes. This 
> explains the inverse relationship between the blue and pink bars on each 
> of the 3 figures.
>
> Let's keep going with the "table exercise":
>
> #+HEADER: :file table.png :width 1000 :height 1000
> #+BEGIN_SRC R :session *R* :exports both :results output graphics
> layout(matrix(1:4,nc=2,byrow=TRUE))
> par(cex=2)
> plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),ylim=c(0,100),
>       xlab="User category",ylab="",main="Fraction of text")
> with(efficiency,sapply(1:4,
>                         function(k) points(runif(10,k-0.2,k+0.2),
>                                            PROZENT2[Kenntnisse==k],
>                                            pch = paste(0:9))))
>
> with(efficiency,plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),
>                       ylim=range(FEHLERST),xlab="User category",
>                       ylab="",main="Formatting errors and typos"))
> with(efficiency,sapply(1:4,
>                         function(k) points(runif(10,k-0.2,k+0.2),
>                                            FEHLERST[Kenntnisse==k],
>                                            pch = paste(0:9))))
>
> with(efficiency,plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),
>                       ylim=range(FEHLEROT),xlab="User category",
>                       ylab="",main="Orthographic and grammatical mistakes"))
> with(efficiency,sapply(1:4,
>                         function(k) points(runif(10,k-0.2,k+0.2),
>                                            FEHLEROT[Kenntnisse==k],
>                                            pch = paste(0:9))))
>
> with(efficiency,plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),
>                       ylim=range(FEHLENDT),xlab="User category",ylab="",
>                       main="Missing words and signs"))
> with(efficiency,sapply(1:4,
>                         function(k) points(runif(10,k-0.2,k+0.2),
>                                            FEHLENDT[Kenntnisse==k],
>                                            pch = paste(0:9))))
> #+END_SRC
>
> We also see a strange thing here: in each of the expert \LaTeX{} and the 
> novice \LaTeX{} users we have one individual who did not right anything 
> but still manage to produce 22 "formatting errors and typos" (!) but 
> luckily no orthographic or grammatical error...
>
> #+BEGIN_SRC R :session *R* :exports both
> with(efficiency,cbind(c(PROZENT2[Kenntnisse==3][10],
>                          FEHLERST[Kenntnisse==3][10],
>                          FEHLEROT[Kenntnisse==3][10],
>                          FEHLENDT[Kenntnisse==3][10]),
>                        c(PROZENT2[Kenntnisse==4][7],
>                          FEHLERST[Kenntnisse==4][7],
>                          FEHLEROT[Kenntnisse==4][7],
>                          FEHLENDT[Kenntnisse==4][7])))
> #+END_SRC
>
>
> Now for the "equations" exercise:
>
> #+HEADER: :file equation.png :width 1000 :height 1000
> #+BEGIN_SRC R :session *R* :exports both :results output graphics
> layout(matrix(1:4,nc=2,byrow=TRUE))
> par(cex=2)
> plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),ylim=c(0,100),
>       xlab="User category",ylab="",main="Fraction of text")
> with(efficiency,
>       sapply(1:4,
>              function(k) points(runif(10,k-0.2,k+0.2),
>                                 PROZENT3[Kenntnisse==k],
>                                 pch = paste(0:9))))
>
> with(efficiency,
>       plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),
>            ylim=range(FEHLERSFOR),xlab="User category",ylab="",
>            main="Formatting errors and typos"))
> with(efficiency,
>       sapply(1:4,
>              function(k) points(runif(10,k-0.2,k+0.2),
>                                 FEHLERSFOR[Kenntnisse==k],
>                                 pch = paste(0:9))))
>
> with(efficiency,
> plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),ylim=range(FEHLEROFOR),
>            xlab="User category",ylab="",
>            main="Orthographic and grammatical mistakes"))
> with(efficiency,
>       sapply(1:4,
>              function(k) points(runif(10,k-0.2,k+0.2),
>                                 FEHLEROFOR[Kenntnisse==k],
>                                 pch = paste(0:9))))
>
> with(efficiency,
>       plot(c(1,4),c(0,100),type="n",xlim=c(0.5,4.5),
>            ylim=range(FEHLENDFOR),xlab="User category",ylab="",
>            main="Missing words and signs"))
> with(efficiency,
>       sapply(1:4,
>              function(k) points(runif(10,k-0.2,k+0.2),
>                                 FEHLENDFOR[Kenntnisse==k],
>                                 pch = paste(0:9))))
> #+END_SRC

-- 
Thomas S. Dye
http://www.tsdye.com

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word ---LOOK AT THE DATA!
  2014-12-28 21:40 ` Efficiency of Org v. LaTeX v. Word ---LOOK AT THE DATA! Christophe Pouzat
  2014-12-29 19:47   ` Thomas S. Dye
@ 2014-12-31 16:59   ` Colin Baxter
  1 sibling, 0 replies; 21+ messages in thread
From: Colin Baxter @ 2014-12-31 16:59 UTC (permalink / raw)
  To: emacs-orgmode


Dear Christophe,

Great work. You should submit it to http://www.plosone.org/ as a
response. It would be interesting to see what the Referees make of it.


Best wishes,

Colin.

> Hi all,
>
> After seeing Ken's mail:
>
> Le 26/12/2014 23:47, Ken Mankoff a écrit :
>> People here might be interested in a publication from [2014-12-19 Fri]
>> available at http://dx.doi.org/10.1371/journal.pone.0115069
>>
>> Title: An Efficiency Comparison of Document Preparation Systems Used
>> in Academic Research and Development
>>
>> Summary: Word users are more efficient and have less errors than even
>> experienced LaTeX users.
>>
>> Someone here should repeat experiment and add Org into the mix, perhaps
>> Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
>> Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
>> pandoc) beat straight Word?
>>
>>    -k.
>>
>>
> and some of replies it triggered on the list, I went to check the paper. 
> As many of you guys I found some "results" puzzling in particular:
> 1. the use of bar graphs when the data would better be displayed 
> directly (that qualifies immediately the paper as "low quality" for me).
> 2. the larger error bars observed for LaTeX when compared to Word.
> 3. the systematic inverse relationship between the blue and pink bars 
> heights.
>
> So I went to figshare to download the data and looked at them. A quick 
> and dirty "analysis" is attached to this mail in PDF format (generated 
> with org, of course, and this awful software called LaTeX!) and the 
> source org file can be found at the bottom of this mail. I used R to do 
> the figures (and I'm sure the authors of the paper will then criticize 
> me for not using Excel with which everyone knows errors are generated 
> much more efficiently).
>
> I managed to understand the inverse relationship in point 3 above: the 
> authors considered 3 types of mistakes / errors:
> 1. Formatting and typos error.
> 2. Orthographic and grammatical errors.
> 3. Missing words and signs.
> Clearly, following the mail of Tom (Dye) on the list and on the Plos web 
> site, I would argue that formatting errors in LaTeX are bona fide bugs. 
> But the point I want to make is that the third source accounts for 80% 
> of the total errors (what's shown in pink bars in the paper) and clearly 
> the authors counted what the subjects did not have time to type as an 
> error of this type. Said differently, the blue and pink bars are showing 
> systematically the same thing by construction! The second type of error 
> in not a LaTeX issue (and in fact does not differ significantly from the 
> Word case) but an "environment" issue (what spelling corrector had the 
> LaTeX users access to?).
>
> There is another strange thing in the table copy case. For both the 
> expert and novice group in LaTeX, there is one among 10 subjects that 
> did produce 0% of the table but still manage to produce 22 typographic 
> errors!
>
> The overall worst performance of LaTeX users remains to be explained and 
> as mentioned in on the mails in the list, that does not make sense at 
> least for the continuous text exercise. The method section of the paper 
> is too vague but my guess is that some LaTeX users did attempt to 
> reproduce the exact layout of the text they had to copy, something LaTeX 
> is definitely not design to provide quickly.
>
> One more point: how many of you guys could specify their total number of 
> hours of experience with LaTeX (or any other software you are currently 
> using)? That what the subjects of this study had to specify...
>
> Let me know what you think,
>
> Christophe
>
>
----- Snip -----

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-27 10:36   ` M
  2014-12-27 11:36     ` Fabrice Popineau
@ 2014-12-31 18:19     ` Paul Rudin
  1 sibling, 0 replies; 21+ messages in thread
From: Paul Rudin @ 2014-12-31 18:19 UTC (permalink / raw)
  To: emacs-orgmode-mXXj517/zsQ

M <Elwood151-S0/GAf8tV78@public.gmane.org> writes:

>> Von: Paul Rudin <paul-sqPYmOVXOov10XsdtD+oqA@public.gmane.org>
>> 
>> No mention of emacs... who uses anything else to prepare their LaTeX?
>> 
> Did you forget the " ;-)" or are you serious?

I wasn't being entirely serious; but I was alluding to a serious
point. You can't really compare a command line typesetting system alone
with a word processor. To make a proper comparison you'd have to look at
the complete toolchain.

For example, some of the errors are typos. Word, of course, has a speil
chucker. Did the LaTeX users use an editor that highlights such errors?

But as others have pointed out the more fundamental problem with the
study is that it tries to assess secretarial or copy-editing skills
rather than authoring skills.

(I haven't actually read the paper, just what has been said in this
thread.)

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2014-12-26 22:47 Efficiency of Org v. LaTeX v. Word Ken Mankoff
                   ` (6 preceding siblings ...)
  2014-12-28 21:40 ` Efficiency of Org v. LaTeX v. Word ---LOOK AT THE DATA! Christophe Pouzat
@ 2015-01-04 20:38 ` John Kitchin
  2015-01-04 21:15   ` Andreas Leha
  7 siblings, 1 reply; 21+ messages in thread
From: John Kitchin @ 2015-01-04 20:38 UTC (permalink / raw)
  To: Ken Mankoff; +Cc: Org-mode mailing list

Ken Mankoff <mankoff@gmail.com> writes:

Did anyone see the parody of this here:
http://mjambon.github.io/vim-vs-emacs/

It is pretty funny!

> People here might be interested in a publication from [2014-12-19 Fri]
> available at http://dx.doi.org/10.1371/journal.pone.0115069
>
> Title: An Efficiency Comparison of Document Preparation Systems Used
> in Academic Research and Development
>
> Summary: Word users are more efficient and have less errors than even
> experienced LaTeX users.
>
> Someone here should repeat experiment and add Org into the mix, perhaps
> Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
> Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
> pandoc) beat straight Word?
>
>   -k.
>
>
>

-- 
-----------------------------------
John Kitchin
Professor
Doherty Hall A207F
Department of Chemical Engineering
Carnegie Mellon University
Pittsburgh, PA 15213
412-268-7803
http://kitchingroup.cheme.cmu.edu

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: Efficiency of Org v. LaTeX v. Word
  2015-01-04 20:38 ` Efficiency of Org v. LaTeX v. Word John Kitchin
@ 2015-01-04 21:15   ` Andreas Leha
  0 siblings, 0 replies; 21+ messages in thread
From: Andreas Leha @ 2015-01-04 21:15 UTC (permalink / raw)
  To: emacs-orgmode

Hi John,

John Kitchin <johnrkitchin@gmail.com> writes:
> Ken Mankoff <mankoff@gmail.com> writes:
>
> Did anyone see the parody of this here:
> http://mjambon.github.io/vim-vs-emacs/
>
> It is pretty funny!

Indeed.  Thanks for sharing.

Regards,
Andreas


>
>> People here might be interested in a publication from [2014-12-19 Fri]
>> available at http://dx.doi.org/10.1371/journal.pone.0115069
>>
>> Title: An Efficiency Comparison of Document Preparation Systems Used
>> in Academic Research and Development
>>
>> Summary: Word users are more efficient and have less errors than even
>> experienced LaTeX users.
>>
>> Someone here should repeat experiment and add Org into the mix, perhaps
>> Org -> ODT and/or Org -> LaTeX and see if it helps or hurts. I assume
>> Org would trump LaTeX, but would Org -> ODT or Org -> X -> DOCX (via
>> pandoc) beat straight Word?
>>
>>   -k.
>>
>>
>>

^ permalink raw reply	[flat|nested] 21+ messages in thread

end of thread, other threads:[~2015-01-04 21:16 UTC | newest]

Thread overview: 21+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-12-26 22:47 Efficiency of Org v. LaTeX v. Word Ken Mankoff
2014-12-26 23:36 ` Thomas S. Dye
2014-12-27  2:21   ` briangpowell .
2014-12-27 14:36     ` Eric S Fraga
2014-12-27  3:26 ` Christopher W. Ryan
2014-12-28 22:45   ` Bob Newell
2014-12-27  4:27 ` Nick Dokos
2014-12-27  9:06   ` Peter Neilson
2014-12-27 14:38     ` Eric S Fraga
2014-12-27  9:48 ` Achim Gratz
2014-12-27 10:05 ` Paul Rudin
2014-12-27 10:36   ` M
2014-12-27 11:36     ` Fabrice Popineau
2014-12-28 22:43       ` Pascal Fleury
2014-12-31 18:19     ` Paul Rudin
2014-12-27 13:37 ` Daniele Pizzolli
2014-12-28 21:40 ` Efficiency of Org v. LaTeX v. Word ---LOOK AT THE DATA! Christophe Pouzat
2014-12-29 19:47   ` Thomas S. Dye
2014-12-31 16:59   ` Colin Baxter
2015-01-04 20:38 ` Efficiency of Org v. LaTeX v. Word John Kitchin
2015-01-04 21:15   ` Andreas Leha

Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.