KDIR causal knowledge

less than 1 minute read

Automatic identification of quasi-experimental designs for discovering causal knowledge

by Jensen et. al

Introduction

  • blackbox approaches (statistical models, artificial intelligence, ...)
  • relational dependency network; visualizes the impact of factors (in a way similar to a spreading activation network)
  • difference: statistical associated <-> causal effect
  • people are interested in causal data
  • compare:
    • elections <-> campaign strategy
    • social science <-> social politcy, ..

  • problem:
    • elimination of common causes (3rd influence factor; Scheinkorrelation)
    • approaches toward eliminating the effect of common cause
      • control => hold 3rd variables
      • randomization -> compare: sample design (Fischer 1925)
      • Modeling: remove effects of potential common causes
      • quasi-experimental designs
    • emulates control by finding structures in the data¨-> set of conditions to tread the condition as if the data would come from an experiment
    • examples designs:
      • twin design - twins => eliminate the influence of genetics
      • two group pre-test post-test design
      • instrumental variable design
  • challenge: algorithms to (automatically) use the idea of quasi-experimental designs
    • requirements: temporal / spatial structure; relational structure
    • number of possible causal models is (3^(N^2))

    Method

    • use google-scholar queries to compare a topic's popularity
    Literature:

    • Jensen et al. (2008) Automatic identification of quasi-experimental designs for discovering causal knowledge (KDD 2008)
    • work on causal interpretation of Bayes networks