Archive for 2006

Choosing Information Resources

Monday, February 20th, 2006

According to this paper the most influential criteria engineers and scientists use for selecting information resources are not quality or even familiarity but 1. the time it takes to track down information and 2. the authoritativeness of the resource. Perhaps this explains why researchers are increasingly using Google Scholar rather than PubMed (PubMed is more authoritative, but Google’s ranked results allow you to find publications faster) and why the NCBI still receives more requests for protein-related data than we do (both sites are equally bad at searching, but NCBI may be seen as more authoritative).

Biologists and Computer Scientists

Wednesday, February 15th, 2006

Much has been said about (and blamed upon) communication problems between biologists and computer scientists. But after attending a meeting with biologists and computer scientists this week, I am starting to suspect that the problem isn’t just communication.

How to Encourage Contributions

Wednesday, February 8th, 2006

We are currently exploring various strategies to encourage people to let us know when they find errors or omissions in UniProt, or even to contribute data as they publish their research, rather than waiting for a curator to pick up their results from a publication.

UniProt & Creative Commons

Tuesday, February 7th, 2006

After having meetings and discussions that dragged on for more than an year, we are now finally distributing our data under a Creative Commons license.

Data and Reality

Monday, February 6th, 2006

Brief review of Data and Reality by William Kent. This book was written in 1978, but is still remarkably relevant in many ways.

ToonTalk

Tuesday, January 31st, 2006

It looks like our initiative to port all of our code to a more compact and efficient language didn’t get very far. So what next?

Call for Better Information Retrieval Systems

Friday, January 27th, 2006

From a recent review article in Nature Genetics:

[...] current ad hoc IR systems are not able to retrieve our example sentence when they are given the query ‘yeast cell cycle’. Instead, this could be achieved by realizing that ‘yeast’ is a synonym for S. cerevisiae, that ‘cell cycle’ is a Gene Ontology term, that the word ‘Cdc28′ refers to an S. cerevisiae protein and finally, by looking up the Gene Ontology terms that relate to Cdc28 to connect it to the yeast cell cycle. Although this will not be easy, we see this form of query expansion as the next logical step for ad hoc IR.

Emotional Design

Monday, January 23rd, 2006

Brief review of Emotional Design. Donald Norman uses this book to expand a bit on the well-known The Design of Everyday Things. In particular, the new book discusses some of the non-rational aspects of design that the previous book ignores.

Information Visualization

Monday, January 23rd, 2006

Brief review of Information Visualization by Colin Ware. This is a textbook with a great amount of detail on the physics, biology and psychology behind visual perception.

Fast Judgement

Monday, January 23rd, 2006

Nature reports about a study showing that users may be deciding on whether or not they like a site within a fraction of a second.