Collective Intelligence

Albert Weichselbraun

is a Professor of Information Science at the University of Applied of the Grisons.

Collective Intelligence

less than 1 minute read

by Segaran Toby

An excellent guide to programming Web 2.0 applications, with code examples and excellent explanations of the used techniques.

Similarity Metrics

Eucleadian distance
Pearson coefficient (corrects grade inflation; users giving constantly higher/lower ratings)
Tanimoto coeffient $$\frac{A \cap B}{A \cup B}$$

Clustering The author applies Fick's law to clustering (only cluster terms, occuring in >0.1 and <0.5 percentage of the documents.

hierarchical clustering
k-means
Multidimensional Scaling (the clustering distance is proportional to the relations between the terms)

Search engines The book presents weighting techniques for search engine's, including:

Number of occurrences
Document location (early words have higher weights)
Word distance (for multiple terms)
Page rank
Link text (higher weights for terms occurring in links)

Share on

Twitter Facebook LinkedIn

You may also enjoy

Big, Linked Geospatial Data and Its Application in Earth Observation

less than 1 minute read

Integrating earth observation data with linked open data would pave the way for easy reuse and integration of these datasets. The article discusses how knowl...

Employment relations: a data driven analysis of job markets using online job boards and online professional networks

less than 1 minute read

Career websites contain valuable data on employees, their skill sets and, employment history. This article uses k-means clustering on keywords describing ski...

Suffix array

1 minute read

The suffix array is a memory-efficient alternative to the suffix tree which provides a sorted list of string indices indicating the string’s suffixes.

Dynamic feature scaling for online learning of binary classifiers

less than 1 minute read

This article describes and evaluates different online feature scaling approaches and their impact on the performance of binary classifiers. online feature...