UDR is now being using by the UC Santa Cruz Gene Browser.
Boot strap labeling using ensembles of classifers trained on small subsets of data.
A perspective on some of the challenges analyzing remote and distribute data.
I'm a data scientist who has been working with big data since 1988. This site contains some of my technical articles, talks, and blogs posts. I'm a faculty member and the Director of the Center for Data Intensive Science at at the University of Chicago. I'm also the Chief Data Scientist of Open Data Group, which builds predictive models over big data.