Ken Novak's Weblog
Purpose of this blog: to retain annotated bookmarks for my future reference, and to offer others my filter technology and other news. Note that this blog is categorized. Use the category links to find items that match your interests.
Subscribe to get this blog by e-mail.
New: Read what I'm reading on Bloglines.

Collection of raw notes on Bayes implementations

Future Now: Bayesian Nets: "A very good tutorial on Bayesian Nets, with lots of supporting information. Via the package Netica from Norsys. This methodology is becoming more common for delivering expertise. Because its statistically based it can model aspects of uncertainty in a system. The site has downloadable software for testing. This can be seen as a replacement for the 'rule bases' that we used for delivering expertise back in the 90s in expert systems. Here is another useful online tutorial."

below mostly collected from Slashdot | Bayesian Filtering Outside of Email?

bug/suggestion tracking (Score:1)
by yardbird (165009) * on Tuesday March 30, @03:08PM (#8717148)
When the original "plan for spam" article came out, I got excited about it and incorporated it into a suggestion tracking system I was working on. The end result was nice. In the system, the user would look at email and associate it with existing suggestions or bug reports. The system learned what words were associated with which suggestions or bugs, and would show the user a list of suggestions which might be relevant for the email he was viewing. It worked surprisingly well.
Re:bug/suggestion tracking (Score:2)
by AeiwiMaster (20560) on Sunday April 04, @09:44AM (#8761177)
I would like to learn more about this.

Do you have a link or other info ??
Re:bug/suggestion tracking (Score:1)
by yardbird (165009) * on Tuesday April 06, @11:42PM (#8788667)
Sad to say, it was a work-for-hire so I don't have rights to the source. If you have any general questions about it, feel free to contact me: asthma_pie at earthlink dot net.

Statistical background:

Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval - Lewis (ResearchIndex): "The naive Bayes classifier, currently experiencing a renaissance in machine learning, has long been a core technique in information retrieval. We review some of the variations of naive Bayes models used for text retrieval and classification, focusing on the distributional assumptions made about word occurrences in documents"

Mature scientific applications:

MrBayes: "MrBayes is a program for the Bayesian estimation of phylogeny. [The evolutionary relationships among organisms; the patterns of lineage branching].  Bayesian inference of phylogeny is based upon a quantity called the posterior probability distribution of trees, which is the probability of a tree conditioned on the observations. The conditioning is accomplished using Bayes's theorem. The posterior probability distribution of trees is impossible to calculate analytically; instead, MrBayes uses a simulation technique called Markov chain Monte Carlo (or MCMC) to approximate the posterior probabilities of trees."

BAMBE: "BAMBE Bayesian Analysis in Molecular Biology and Evolution"

The BUGS Project: "BUGS is a program that carries out Bayesian inference on complex statistical problems for which there is no exact analytic solution, and for which even standard approximation techniques have difficulties. Conditional independence assumptions mean that it is often convenient to represent the essential structure of the problem as a graphical model. A Markov chain Monte Carlo (MCMC) approach to numerical integration is used: "

Public Bloglines
Technorati Profile
news search
blog search
Last update: 11/25/2005; 12:18:18 AM.
0 page reads.