Open Mind Common Sense

Wednesday, April 22, 2009

New site.

We've got a new version of the Open Mind Common Sense site: openmind.media.mit.edu

The big changes:

It's based on the Pinax web framework. This should make it easier to add features to the site.
It's running on ConceptNet 3.5 instead of 3.0. (So was the old site, kinda, but it was a hack that wasn't sustainable.)
It distinguishes between "assertions", the normalized connections between concepts that Open Mind learns from, and "statements", the roughly parsed text that people have typed in. You can vote on both of them. This is a key step toward putting back the free text box.

Thursday, February 26, 2009

Also in the realm of new and exciting stuff is our announcement mailing list. It's we'll use it for any big news we have and to announce workshops, symposiums, and software releases.

Subscribe yourself here!

Launchpad and Bazaar

We're on Launchpad now. We can host our version control there, track bugs, and answer questions from users.

For people who work on Open Mind within the Media Lab (and possibly even others), here's a guide to hacking on the code using Bazaar.

Thursday, February 19, 2009

IUI

Hi everyone!

We've been quite busy lately and I'm going to make a point to update this more often. I really intend to, and since all I have to do this semester is graduate and transition to being a post-doc on top of Open Mind I should have lots of free time. What have we been up to? Well...

Most recently, Henry, Erik Mueller , and I ran a workshop on story understanding at IUI last week. Rodger Schank gave an interesting keynote on designing user interfaces using stories. He focused on how people interact and convey information naturally using stories rather than using the constructs which are common in user interfaces today. He talked about how "cavemen" communicated (ie, what are the modes of interaction we've been using all along), just-in-time information and making interfaces more goal-directed. We had a lot of good discussions, saw papers presented, had a demo session and many of us are still communicating by email. I'll try to post a more comprehensive summary later.

Also at IUI, Jayant and I gave a main conference talk on a paper by pretty much all of us using mixture models to play a game of twenty questions with the user. It asks questions which are selected to help AnalogySpace infer information about a new concept.

There's lots of stuff in the pipeline:

There are thoughts of an AAAI symposium in a year for the entire common sense community.
I'm working on a new technique for infusing normal data and reasoning techniques with common sense. It's working really well.
We're planning on putting up a lot of documentation soon and possibly some videos. Keep watching.

Monday, August 11, 2008

Assertions, sentences, and ratings

Ken mentioned that we're in the middle of reorganizing the database. I'll fill in some more details about what we're doing.

Currently, our users give their ratings to assertions, the things that make up the links of ConceptNet. Many sentences can yield the same assertion: for example, "dogs are mammals" and "a dog is a kind of mammal" both turn into an assertion that can be expressed as IsA(dog, mammal). The ratings on these assertions are useful to representations such as AnalogySpace.

The problem is that the OMCS web site doesn't want to show you IsA(dog, mammal), it wants to show you something in natural language. And some of the natural language we've collected is of

What should matter to OMCS isn't just how good the abstracted assertions are, it's how good the sentences are.

So we're reorganizing the database. After this, your ratings will apply to the sentences you see, and the scores on assertions will come from aggregating those ratings. We'll display each assertion using its highest-rated sentence. This puts our users in charge of which sentences show up, instead of arbitrary decisions by the computer.

The hard part of the reorganization is that we have to take all of the existing ratings and find out where they came from. If they came from a user on the new site, for example, we need to know what sentence they were looking at when they gave the rating. The database generally has this information, but not necessarily recorded in a smart way.

So what we've really been doing is cleaning up messes in the database while we track down where the ratings should go. And most of which were created by me a couple of days before giving a demo. Sorry about that.

Facebook

Hi everyone, I'm Catherine.

I will post some more substantive soon, but first-off I'd like to say we have a facebook group now. I'll hopefully update it with pictures and such as things become available.

In other news Rob, Henry, and I went to AAAI in Chicago to present AnalogySpace which went rather well. I had a great conference and really enjoyed the city of Chicago.

Friday, August 8, 2008

A Brief Note on Status

First, thanks to everybody who has been contributing! We're running all sorts of cool analysis stuff with our data, and most everything you put it makes the analysis a bit better. And that's just the beginning...

We've gotten some feedback recently that basically suggests that our interaction with our user community has been lacking. That has definitely ratcheted up in our priorities, and we have a few things in the works. But for the moment, I thought I'd hit on a few items that people have asked about recently.

Usability: Obviously the site has usability flaws -- some larger than others. Specific feedback is most helpful. We've been doing a lot of grimy back-end work, but with your help we won't neglect the front end.
Speed: will improve a lot when we move to a server that doesn't have a game port in the back (really). There are certainly some optimizations to do also, but we're prioritizing increased functionality. Stay tuned...
Acquiring common sense from elsewhere. Though we haven't shared any of the work yet, we've actually done a lot of work in combining our dataset with other data to get interesting new results. Soon we'll be leveraging those tools to pull in large amounts of (hopefully useful) information from several user-contributed large databases of knowledge. One is, yes you guessed it, Wikipedia.
The "fix" flag is just a temporary flag until we implement the UI to edit stuff. It's not quite as simple as it seems because of the interaction with ratings, etc. We're reorganizing the database right now so that such things will become possible.
Stats are actually available on http://commons.media.mit.edu/en/stats/ which we haven't promoted yet because it's not complete or well-explained. We do have the raw data to do graphs and other spiffy stuff, but just haven't gotten around to it. Suggestions for neat graphing libraries, or just straight-up code contributions, are welcome.
Community involvement is very important to us, though we haven't been showing it yet. We're trying to run this as an open-source project. We haven't officially released the main website code, but we can send a tarball on request, and we're considering ways of doing better. If you can code, you'll be welcome to help out, or help recruit others. The site is written in Python, using Django and the ConceptNet and Divisi libraries that are already available (see the links on the home page).

If there are any other things you'd suggest we do to improve interaction with the user community, please share your views in the comments.

Also, feel free to ask anything -- about the site, the project, us, etc.; we'll try to respond.

-Ken