Science wants to be free
In science we learn and do research based on the findings of — and conclusions drawn by — others before us. (”Standing on the shoulders of giants” ring a bell?) That is the very essence of cumulative science. To make the obligatory car analogy: What if the car manufacturers had to reinvent the wheel and the engine (steam first) every time they wanted to make a new model? That would most certainly make Thomas Kuhn cry, and we’d be driving ridiculously old-fashioned cars.
Yesterday my professor told me that he couldn’t actually provide us with access to various papers on the subject of the course he’s currently teaching, because certain publishers have hired students to spy on the teachers at the university and report back on any copyright infringements they notice. Usually the teacher would upload a PDF of the relevant article on our intranet, thus allowing students to download (and print), but now we’ll have to obtain these papers by other means, which may be any of the following:
- try to get hold of some obscure copy of some obscure journal containing the article in question
- pray that the publisher has made it available (either for free or relatively cheap) on the web
- get in contact with someone who owns the relevant issue of the relevant journal containing the relevant article and hope they’ll let us xerox it (and still infringe on the copyright, but this time without the professor’s help)
… and that’s only the articles.
A lot of the texts we use in linguistics come from various collections, where a number of people contribute to the work published in book form. Now, we have some good libraries here, but they rarely have more than a single copy of a book (well, maybe some older editions), which means that if every student had to read an article from a particular book, they’d actually have to either borrow the book or read the article at the library. Think about it… 24 students all wanting to read the same article (which they’re not allowed to xerox) in the same book, of which the library only has one copy.
One could, of course, buy all the journals, collections, etc… But that would be so ridiculously expensive that studying something like linguistics would cost a fortune. Handing in a decent paper based on good research would quickly run into the thousands of euros. And we’re expected to write 2 or 3 of those papers each semester…
Now I’m thinking Science Bay (or maybe Science Nova) — a marriage between Pirate Bay and Discogs containing nothing but torrents of scientific articles neatly categorized, with detailed meta data, and searchable in every imaginable way. It would even be possible to provide BibTeX entries for every article, and even provide one huge, downloadable BibTeX database for the entire content of Science Bay. When uploading a torrent, one would be able to either fill out all the relevant meta data form fields for the article in question, or simply paste a BibTeX entry containing all the relevant/required meta data. It wouldn’t just be a solution to a problem, it’d be awesome! Searching for and actually finding relevant articles couldn’t be much easier! I’m not talking about sharing entire books or even just chapters of books. Just scientific articles from journals, collections, and what have you.
Only problem is, where would one get such a project hosted? We’ve all heard about the legal trouble Pirate Bay has faced through the times and I certainly can’t afford to hire a lawyer to fend off pissed off pubilshers. While I could probably easily do the code for the project, I don’t have access to decent servers, bandwidth, Hungary, legal advise, etc…
But isn’t this how science is really supposed to work? Everyone sharing what they find and contributing to a greater cause. I know some scientists like to live like rock stars (I’m looking at you, Hawking! :-p), but that’s no excuse for their publishers behaving like retarded record labels.