Recent posts

Text-based Information Retrieval (T-IR)

1 minute read

a Workshop organized by Benno Stein. Content Extraction (from Web pages) Filter lines based on the content to tags ratio.</p> a <- ASCII character...

Utilification

less than 1 minute read

by John Wilkes, Jeffrey Mogul, and Jaap Suermondt This paper elaborates on utilitfication - the transfer of applications to utility computing. Utility comput...

Designing a Better Shopbot

2 minute read

This paper describes optimizing the design of a shopbot (=shopping robot) which considers, the intrinsic value of the product, the disutility from waiting, ...

Collective Intelligence

less than 1 minute read

by Segaran Toby An excellent guide to programming Web 2.0 applications, with code examples and excellent explanations of the used techniques. Similarity Metr...

Information Diffusion

less than 1 minute read

by Dimitry Zibold The following article summarizes some interesting aspects from Dimitry's research: A Shingle is a contiguous sub-sequence of tokens in a d...

Getting Better Search Results

less than 1 minute read

Human-aided filtering can make the difference (by Bob Zeidman). Bob presents in this article how human-aided filtering can improve filtering accuracy. At fir...