Datamining Pages:
feed me
Datamining - Recent Bookmarks - Page 1:
Data Brewery and Cubes
Brewery is a Python framework and collection of tools for analysing and mining data. Goal is to provide functions and tools for: streaming and processing structured data from various sources, suc... Read more
http://databrewery.org/
Tags: python, datamining, olap, programming, data, analytics, development, tools, opensource, database Saved by: admin at 09 Jun 2011
Brewery is a Python framework and collection of tools for analysing and mining data. Goal is to provide functions and tools for: streaming and processing structured data from various sources, suc... Read more
http://databrewery.org/
Tags: python, datamining, olap, programming, data, analytics, development, tools, opensource, database Saved by: admin at 09 Jun 2011
COS 493, Spring 2002: Schedule and Readings
The course deals with algorithmic tools and techniques for the organization, manipulation and processing of large amounts of data. With the current information explosion, several applications require ... Read more
http://www.cs.princeton.edu/courses/archive/spring02/cs493/schedule.html
Tags: algorithms, data, algorithm, datamining, programming, cs, machinelearning, lecture, ai, statistics Saved by: admin at 06 Jun 2011
The course deals with algorithmic tools and techniques for the organization, manipulation and processing of large amounts of data. With the current information explosion, several applications require ... Read more
http://www.cs.princeton.edu/courses/archive/spring02/cs493/schedule.html
Tags: algorithms, data, algorithm, datamining, programming, cs, machinelearning, lecture, ai, statistics Saved by: admin at 06 Jun 2011
Data Mining Map
Excelente libro online de DataMining La minería de datos trata de explicar el pasado y predecir el futuro por medio del análisis de datos. Este es un campo multidisciplinario que combina la estadí... Read more
http://chem-eng.utoronto.ca/~datamining/dmc/data_mining_map.htm
Tags: datamining, reference, tutorial, data, statistics, data-mining, programming, mining, book, visualization Saved by: admin at 23 May 2011
Excelente libro online de DataMining La minería de datos trata de explicar el pasado y predecir el futuro por medio del análisis de datos. Este es un campo multidisciplinario que combina la estadí... Read more
http://chem-eng.utoronto.ca/~datamining/dmc/data_mining_map.htm
Tags: datamining, reference, tutorial, data, statistics, data-mining, programming, mining, book, visualization Saved by: admin at 23 May 2011
Matt Croydon::Postneo » Blog Archive » Social Graph Analysis using Elastic MapReduce and PyPy
a couple of papers (Who Says What to Whom on Twitter and What is Twitter, a Social Network or a News Media?) that cited data collected by researchers for the latter paper. This 5 gigabyte compressed ... Read more
http://postneo.com/2011/05/04/social-graph-analysis-using-elastic-mapreduce-and-pypy
Tags: python, mapreduce, hadoop, social, aws, datamining, ec2, analytics, howto, map Saved by: admin at 05 May 2011
a couple of papers (Who Says What to Whom on Twitter and What is Twitter, a Social Network or a News Media?) that cited data collected by researchers for the latter paper. This 5 gigabyte compressed ... Read more
http://postneo.com/2011/05/04/social-graph-analysis-using-elastic-mapreduce-and-pypy
Tags: python, mapreduce, hadoop, social, aws, datamining, ec2, analytics, howto, map Saved by: admin at 05 May 2011
The Secrets of Building Realtime Big Data Systems
The Secrets of Building Realtime Big Data Systems (slides) http://slidesha.re/hkLvJN – Nathan Marz (nathanmarz) http://twitter.com/nathanmarz/status/51350038194040832
http://www.slideshare.net/nathanmarz/the-secrets-of-building-realtime-big-data-systems
Tags: data, scalability, programming, bigdata, development, architecture, realtime, slideshow, datamining, scaling Saved by: admin at 27 Mar 2011
The Secrets of Building Realtime Big Data Systems (slides) http://slidesha.re/hkLvJN – Nathan Marz (nathanmarz) http://twitter.com/nathanmarz/status/51350038194040832
http://www.slideshare.net/nathanmarz/the-secrets-of-building-realtime-big-data-systems
Tags: data, scalability, programming, bigdata, development, architecture, realtime, slideshow, datamining, scaling Saved by: admin at 27 Mar 2011
Data Science Toolkit
A collection of the best open data sets and open-source tools for data science, wrapped in an easy-to-use REST/JSON API with command line, Python and Javascript interfaces. Available as a self-contain... Read more
http://www.datasciencetoolkit.org/
Tags: data, tools, opensource, datamining, python, text, tool, science, api, geocoding Saved by: admin at 25 Mar 2011
A collection of the best open data sets and open-source tools for data science, wrapped in an easy-to-use REST/JSON API with command line, Python and Javascript interfaces. Available as a self-contain... Read more
http://www.datasciencetoolkit.org/
Tags: data, tools, opensource, datamining, python, text, tool, science, api, geocoding Saved by: admin at 25 Mar 2011
Waffles
A collection of command-line tools for researchers in machine learning, data mining, and related fields. All of the functionality is also provided in a clean C++ class library. Demo apps are included ... Read more
http://waffles.sourceforge.net/
Tags: machinelearning, ai, c++, opensource, library, datamining, programming, research, software, machine-learning Saved by: admin at 22 Mar 2011
A collection of command-line tools for researchers in machine learning, data mining, and related fields. All of the functionality is also provided in a clean C++ class library. Demo apps are included ... Read more
http://waffles.sourceforge.net/
Tags: machinelearning, ai, c++, opensource, library, datamining, programming, research, software, machine-learning Saved by: admin at 22 Mar 2011
Overview: Extracting article text from HTML documents | My tech blog.
"In the world of web scraping, text mining and article reading utilities (readability bookmarklet) there is an ever growing demand for utilities that are capable of distinguishing parts of a HTML docu... Read more
http://tomazkovacic.com/blog/14/extracting-article-text-from-html-documents/
Tags: html, text, datamining, scraping, extraction, algorithms, nlp, research, content, web Saved by: admin at 20 Mar 2011
"In the world of web scraping, text mining and article reading utilities (readability bookmarklet) there is an ever growing demand for utilities that are capable of distinguishing parts of a HTML docu... Read more
http://tomazkovacic.com/blog/14/extracting-article-text-from-html-documents/
Tags: html, text, datamining, scraping, extraction, algorithms, nlp, research, content, web Saved by: admin at 20 Mar 2011
Pattern | CLiPS
Pattern is a web mining module for the Python programming language from University of Antwerp computational linguistics It bundles tools for data retrieval (Google + Twitter + Wikipedia API, web spid... Read more
http://www.clips.ua.ac.be/pages/pattern
Tags: python, datamining, nlp, programming, software, library, web, data, language, mining Saved by: admin at 25 Feb 2011
Pattern is a web mining module for the Python programming language from University of Antwerp computational linguistics It bundles tools for data retrieval (Google + Twitter + Wikipedia API, web spid... Read more
http://www.clips.ua.ac.be/pages/pattern
Tags: python, datamining, nlp, programming, software, library, web, data, language, mining Saved by: admin at 25 Feb 2011
Wrangler
Wrangler is an interactive tool for data cleaning and transformation. Spend less time formatting and more time analyzing your data. Why wrangle? * Too much time is spent manipulating data just to... Read more
http://vis.stanford.edu/wrangler/
Tags: data, tools, analysis, datamining, visualization, opensource, analytics, cleaning, stanford, tool Saved by: admin at 05 Feb 2011
Wrangler is an interactive tool for data cleaning and transformation. Spend less time formatting and more time analyzing your data. Why wrangle? * Too much time is spent manipulating data just to... Read more
http://vis.stanford.edu/wrangler/
Tags: data, tools, analysis, datamining, visualization, opensource, analytics, cleaning, stanford, tool Saved by: admin at 05 Feb 2011