June 30, 2008

Pluribo: Instant Summaries of Amazon Reviews

Filed under: Misc — Tags: — jeetu @ 9:10 pm

Posted at ReadWriteWeb

by Frederic Lardinois

pluribo-logo.png

Pluribo is a Firefox plugin that displays short summaries of product reviews on Amazon.com. Pluribo scans through reviews customers on Amazon have left and automatically creates a one sentence summary that is somewhat akin to a Zagat review. While Zagat uses human editors to compile its reviews, though, Pluribo is fully automated. Right now, Pluribo only works for the electronics section of Amazon’s store, but the developers are planning to expand this to the rest of Amazon’s offerings soon.

Here is a typical summary that Pluribo created for a SanDisk MP3 player: “This has been on the market for a while. Although there were objections to the software, users are happy with the low price, product support, and battery. If you don’t care about the software, it has potential.”

Besides summarizing the reviews for a given product, Pluribo also compares those reviews to other products in the same category to see where the reviews for this item were different. Pluribo will also give more weight to reviews that were considered ‘helpful’ by other Amazon customers. Pluribo keeps most of this information in the background. However, when hovering over a keyword in the summary, a small pop-up will display a list of relevant phrases Amazon’s customers used to describe the item, as well as some more of the statistical data Pluribo used to compute its summary.

In testing out Pluribo, it consistently displayed accurate summaries of the actual user comments – a testament to how well the developers have tuned their algorithm to at least this limited range of product categories.

Pluribo’s overall execution is quite seamless and Pluribo does not slow down the load times on Amazon, as it only gets to work after the page is fully displayed.

pluribo-compute.png

However, Pluribo seems quite restricted when it comes to what items it will display reviews for and for which it will just display a ‘coming soon’ message. Right now, it only works well for MP3 players, GPS navigation systems, and digital cameras. It also seems to work best for products that have been reviewed at least 30 times.

It would be nice to see Pluribo start pulling in reviews from other sources besides Amazon’s own customers. The fact that it only works as a Firefox plugin is also going to limit its appeal to technically savvy users for the time being.

Overall, Pluribo is a fun and (when it’s working) useful plugin – though for the time being, its a bit too limited to be of real help. It’s real potential is only going to be realized once the developers get out a version that works across all of Amazon’s offerings and maybe even expands beyond Amazon to include other online stores.




Choice of toppers: IIT-Bombay

Filed under: Misc — jeetu @ 8:57 pm
Choice of toppers: IIT-Bombay
1 Jul 2008, 0220 hrs IST, Hemali Chhapia,TNN
MUMBAI: The cream of the country’syoung brains continues to hanker after IIT-Bombay. The institution has retainedits position as the most sought-after IIT in the country, with Delhi and Madrasa distant second and third respectively.

A number of factors havebeen responsible for this: from geography to gastronomy and placement records towhat coaching classes tell students.

Of the top 100 JEE-2008 rankerswho have been admitted to the IITs this year, more than 50% preferred IIT-B overany other IIT (see box). This was followed by Delhi — where 27 of the top100 — have been admitted. While Bombay and Delhi have maintained theirpositions over the years, IIT-Madras has overtaken Kanpur this year.

Collaborative Filtering: Lifeblood of The Social Web

Filed under: Misc — Tags: — jeetu @ 3:48 pm

Posted at ReadWriteWeb

by Muhammad Saleem

Collaborative Filtering (Wikipedia definition) is a mechanism used to filter large amounts of information by spreading the process of filtering among a large group of people. Unlike mainstream media where there is either one or very few editors setting guidelines, the collaboratively filtered social web can have infinitely many editors and gets better as you increase the number of participants.

There are two basic principles involved in Collaborative Filtering.

1. The Wisdom of Crowds and Law of Large Numbers suggest that as communities grow, not only does a large (diverse, independent, etc.) community make better decisions than a handful of editors, but the larger a community gets, the better its decisions will be. Therefore, we can hypothetically create a Collaboratively Filtered newspaper, television channel, radio station, etc., which would be better (for the community) rather than any other arbitrarily selected medium. In fact, as we will see, services like Digg, YouTube, and Last.fm, are trying to do exactly that – (CF) based media outlets.

2. The second principle of Collaborative Filtering suggests that in any such large community, with enough data on individual participants and on how the individual participants collaborate or correlate with each other, we can make predictions about what these users will like in the future based on what their tastes have been in the past, i.e. develop a collaboratively filtered recommendation engine. This, of course, relies on the fact that people’s interests, preferences, and ideologies don’t change too drastically over time.

The two aspects of the (CF) system result in two very different and important results.

The first gives you new, interesting, entertaining, and newsworthy information as judged by the community (in a way this is content that is the average of the interests of the entire community) and a good example of this is Digg’s front page. Not all the content will be directly relevant to your tastes and in fact some of it will be completely irrelevant to you. However, as the community grows and becomes more diverse and independent, the average news story promoted to the front page will be of interest to the average community member. Not satisfied with averages? This is where the second aspect comes into place.

The second aspect of the (CF) system collects information on what kind of content and commentary you like and dislike, and based on your submission and voting habits, it does user-data-profiling. This user profile helps the site recommend content that has been submitted by users (or from sources) you generally agree with and find interesting, as well as topics that you usually vote up and tend to comment on. What this means is that by collecting enough information on how you interact with the site and with other users, the (CF) system can recommend content to you. The system finds the content and deliver it to you rather than it requiring you to scout for it. Furthermore, the more you use the recommendation system and vote up or down, the better it becomes with its recommendations.

The important thing, one that not many social sites realize, is that a (CF) system that doesn’t automatically match content to your preferences, is inherently flawed. The reason for this is simple: Unless you can achieve perfect diversity and independence of opinion, one point of view will always dominate another on a particular platform. The dominant point of view on the social web is a left-leaning one, and without the ability to get the most appropriate pieces of content to the people that care most about them, the right-wing point of view gets buried almost every time.

A perfect example of this was the Ron Paul supporters and the ease with which they were able to manipulate the social news sites. Now if you could match the right-wing viewpoint to the right-wingers, and the left-wing viewpoint to the left-wingers, and get both points of views across to people that are interested in healthy debate rather than partisan politics, you’re getting closer to the ideal system. A filtering system with preference-based recommendations, in essence, is the future of the social web.

Who is using what system?

The (CF) system is without a doubt the lifeblood of the social web. Even though different platforms apply it to varying extents, the system is still there at the core, and the social web would look more like rush hour in downtown Lahore if the community wasn’t actively policing the traffic.

Social News

In the social news space, Digg and Propeller just use the system insofar as the front page is concerned (although Digg is set to release their recommendation engine this week). Once the content is promoted to the front page, the system’s job is done. The system works in that you get rid of spam and unoriginal thought, but it isn’t the best because it relies on averages rather than direct preferences of each participant. While these sites try to catch up and develop recommendation engines of their own, Reddit and StumbleUpon have leapfrogged them for a while now by having recommendation engines in place. These two sites also have similar concepts of a community front page (based on the average interests of the average community member) but they enhance your experience and incentivize increased participation by using your history of likes and dislikes to deliver the most high-quality and most relevant content to you. Furthermore, the normalization of Reddit’s front page shown how a one-front-page-for-all approach forces conformity and dilutes the individual experience, whereas normalization ensures that each user controls how content is distributed to him or her.

Ultimately, even though there are some sites with little or no filtering (Slashdot, Fark, etc.), sites that use their (CF) based recommendation engines will continue to diminish the importance of active filtering from upcoming submission queues and improve the quality of user experience on an individual level.

Video Streaming and Sharing

Online video sites hosting and sharing sites are not much different. Site’s like YouTube have multiple filtering mechanisms that often perform the same functions without requiring votes per se. Viewability, for example, is determined by:

1. Number of people currently watching a video

2. Number of comments on a video

3. User ratings and favorites.

The problem with impressions-based system (like the one used by now understandably dead content aggregator Spotplex) is that just because you viewed something or commented on something doesn’t mean that it’s good. In fact, there are dozens of YouTube videos that I click on, don’t like them and then close the window (I see other people writing negative comments in poor English but I doubt that helps either). Some other sites like Break and Funny or Die use a StumbleUpon-like up/down voting system to determine what gets promoted to the front page. Again, while there are options to view similar/related videos and more videos from a user you like, there is no recommendation system using your rating and favoriting habits (and tags you like).

Blogging and Microblogging

For the most part, blogs use a combination of most viewed, most linked, most commented, and highest rated, as mechanisms for displaying content that you might like. While this is a better idea than letting people go through trial and error, it doesn’t ensure that every visitor will be happy with what they see. For example, two very different posts on two entirely different topics can be the most viewed posts on your blog, and I might like one and not like another. At the same time, one has to wonder, at what point is it economical or time-efficient to start monitoring each individual user?

StumbleUpon solves this problem for the ‘big guys’ by letting you StumbleThru one site for the content that you might like the most. The feature, however, is not available for all sites yet.

Most Microblogging sites, unfortunately, have no filtering system at all. The signal to noise ratio debate rages on with respect to Twitter and its ilk. FriendFeed, however, launched a rudimentary recommendation feature that simple displays the top ‘liked’ and commented links.

Photo Hosting and Sharing

When I was thinking about the concept of (CF) systems, photo-sites like Flickr and Photobucket weren’t even on my radar. Part of the reason for this is how most people I know use these sites, i.e., primarily for hosting and sometimes for finding creative commons images for embedding on their sites. I was, however, quite pleasantly surprised to see that Flickr has gone a long way to help people explore and discover excellent photography.

The feature that most people are probably familiar with is Interestingness. The feature is quite robust. It takes into account things like where the referral traffic to the image is coming from, who is commenting on it and when, who marks it as a favorite and how many people like it, among other more nuanced things. But in addition to that, the site also has other great features such as exploring based on geotagging on a map of the world, popular tags, subject-based and quality-based groups, camera finder, and most recent uploads.

The only thing left to add is a ‘photos you might like’ based on photos you have liked and commented on.

Music Streaming and Discovery

The best implementations of a Collaborative Filtering (CF) system along with a preference based recommendation/discovery system that I have seen are always on music streaming and discovery sites. The implementation on Last.fm for example, is almost perfect in my opinion. First of all, whether you use their online streaming widget or use their desktop software, they monitor every single song you listen to and aggregate that data. They also track how artists jump on and fall off your radar on a week to week basis. They use that data to make specific recommendations and automatically create a radio station for you that plays Last.fm’s recommendations for you based on what you like.

While that in itself is more than enough, they don’t stop there. They have another radio station for you that plays songs you usually like to listen, they show you what the entire Last.fm community is generally listening to, what your friends are listening to, and what your friends are recommending. It is a very robust system for aggregation, filtering, and recommendation. Here’s how the recommendation engine works:

As you can see, they look at the musicians I listen to a lot and then recommend people that are either similar in sound or people who were influencers of or influenced by my favorites. These are followed by recommendations from friends and music-based groups on the site.

So, collaboratively filter and recommend or die?

These are only some of the major players that have embraced (CF) and personalized recommendations – Netflix and Amazon come to mind among others. As you can see from above, it is certainly possible to have a good collaborative filtering system without a recommendation engine (as seen in Flickr). It is optimal, however, for the users (because their experience is better) and your site (because users will participate more often and generally be happy with your product) if you throw in some recommendation system a-la Last.fm, the most robust of the lot by far.

This is a guest post by Muhammad Saleem, a social media consultant and a top-ranked community member on multiple social news sites. You can follow Muhammad on Twitter.




Google Launches Affiliate Advertising Network, Courtesy of DoubleClick

Filed under: Misc — Tags: , , — jeetu @ 10:23 am

Posted at TechCrunch

by Erick Schonfeld

Amazon, watch out. Earlier today, Google launched an affiliate ad network. Or, rather, it rebranded Performics, the affiliate ad network that came along with its purchase of DoubleClick, as the “Google Affiliate Network.” As with other affiliate networks such as Amazon’s, participating Website publishers get paid a fee for each referral that results in a sale. Existing advertisers include Bank of America, Barnes & Noble, Citi, Target, and Verizon.

The service isn’t yet integrated into Google AdSense (publishers and advertisers still have to set up separate accounts), but that would be a logical next step. An integration with AdSense could add a contextual element to the affiliate ads placed through the network. The more relevant Google can make those affiliate links, the more that consumers will actually click through and buy (in theory).

Google also continues to experiment with a pay-per-action advertising program, which is still in beta. At some point, it might make sense to consolidate that effort into the Google Affiliate Network as well.

Update: Google will actually be phasing out the PPA program at the end of August as part of the integration with DoubleClick. You can read more details at the blog post here.

Crunch Network: MobileCrunch Mobile Gadgets and Applications, Delivered Daily.

June 26, 2008

Microsoft To Buy Powerset? Not Just Yet.

Filed under: Misc — Tags: , , — jeetu @ 2:52 pm

Posted at TechCrunch

by Michael Arrington

VentureBeat is reporting that Microsoft has agreed to buy semantic search engine Powerset for somewhere around $100 million, which is the price we previously reported was being offered to the company.

Our sources have been saying this deal is highly likely since May, but hasn’t actually been signed yet and could still be disrupted by the ongoing Microsoft-Yahoo negotiations. Dave Wehner, a Managing Director at investment bank Allen & Co. (he’s the guy who sold Bebo for $850 million to AOL), is representing Powerset in the deal.

Powerset debuted at TechCrunch40 last fall and opened a showcase of its technology to the public just last month.

Powerset has raised around $12.5 million in venture capital, and is rumored to have taken another $8 million or so in convertible debt as bridge financing.

Crunch Network: CrunchGear drool over the sexiest new gadgets and hardware.

ClickPass Adds Google, Facebook, Yahoo, And Hotmail To Its OpenID Gateway

Filed under: Misc — Tags: , , , , , — jeetu @ 1:56 pm

Posted at TechCrunch

by Jason Kincaid

Clickpass, a startup that has simplified the OpenID login platform, has built out support for additional third parties that brings the promise of a universal login even closer. Users will now be able to use their Google, Facebook, Yahoo, or Hotmail passwords on any site that includes the Clickpass authentication system.

The new Clickpass system requires almost no effort from the end user. Supported sites simply embed a button on their login page which prompts users to login with their credentials from one of the aforementioned services; you don’t even need to have a Clickpass account. On supported sites, creating a new account is as simple as logging in with your preferred service (I use Gmail), and picking a display name to show other users. This is what OpenID should be.

So what’s the catch? At launch the service only works on a handful of sites, but CEO Peter Nixey says that implementing it on a website is easy – we can expect to see the number of supported sites skyrocket in the next few days. Developers need only implement the standard OpenID protocol along with the Clickpass system and they’re good to go.

One problem that Clickpass will soon face is that it is really a temporary solution to a problem most of these companies are already working on. We can expect Google, Yahoo, and the rest of the lot to implement their own version of OpenID, which will effectively take Clickpass out of the equation.

Crunch Network: MobileCrunch Mobile Gadgets and Applications, Delivered Daily.

MySpace Opens Up The Data Pipe With Full Launch Of Data Availability

Filed under: Misc — Tags: , , — jeetu @ 8:00 am

Posted at TechCrunch

by Michael Arrington

MySpace was the first of the Big Three to announce tools for third party sites to integrate MySpace user data into their services (called, collectively, Data Availability). A day later Facebook announced Facebook Connect, then came Google Friend Connect three days after that.

Today MySpace is fully launching Data Availability (look for it this afternoon at developer.myspace.com), and any third party developer can now build applications using their APIs. Google’s product remains in a test phase with a handful of sites (example), and we won’t likely hear more from Facebook until their F8 conference in late July.

MySpace is taking a much more interesting approach than Google, which controls data sent to third party sites via an iframe. MySpace is actually streaming data to these sites, which allows for true integration between the services, not just a bolted-on social tool.

Developers can access any publicly available profile data from a MySpace user and integrate it into their site. This includes a user’s name, picture, bio, social graph (list of friends), and other information. Users authorize the data transfer via a one-time secure OAuth login to MySpace from the third party service. The service is then allowed to access the data.

Since actual data is being streamed out of MySpace, they have a strict terms of use policy that forbids third party sites from storing or caching the data, other than the unique MySpace user id of the user. Each time a page is rendered the third party must re-request the data from MySpace via a set of APIs. That means any changes by the user to their MySpace profile data or friends list will be instantly applied across third parties who access the data.

Like Google and Facebook, users will be able to revoke access by any third party via a privacy control panel on their MySpace account:



Actual Data Portability, But No Syncing

This is a real move towards data portability, since MySpace is actually allowing data out of its server vault. The fact that third parties can’t store that data isn’t a perfect solution, since MySpace retains ultimate control of it (I discuss this problem in my Centralized Me post). True data portability requires constant syncing of data so that the users remain in control. But until real standards emerge on just how to do that (and there are some big hurdles), MySpace’s approach seems more than reasonable. This is a real step forward in terms of user data rights, and I expect we’ll see a ton of very creative implementations of Data Availability.

We are building a test application now and should have it live within a few hours. Look for lots of implementations over the next few days.

Crunch Network: CrunchGear drool over the sexiest new gadgets and hardware.

June 25, 2008

Idea acquires Spice Telecom

Filed under: Misc — jeetu @ 8:59 pm

This means we’ll now have Idea in Bangalore

Jeetu

Idea has acquired Spice Telecom and the deal consists of 4 transactions:

  • Idea will acquire the Modis’ 40.8% stake in Spice.
  • Idea will launch the mandatory 20% open offer for the Spice shareholders, jointly with Telekom Malaysia International (TMI).
  • Idea will merge Spice with itself and offer a 14.99% stake to TMI through a preferential allotment.
  • The Idea-TM combine will launch the open offer at Rs 77.30 jointly with TMI, which now holds 39.2%in Spice

Idea will earn Rs 7,294 crore ($1.7 billion, assuming an exchange rate of Rs 43) by selling the 20% stake to TMI(making it one of the largest infusions of FDI into India.)

Spice shareholders will get 49 Idea shares (after the TMI preferential allotment)for every 100 shares they hold and the deal is supposed to be over by end of 2008.

The deal gives Idea an entry into the Punjab and Karnataka markets, and Spice’s 4.4 million customers.

The Indian telecom space is heating up with deep pocket players’ entry like Virgin.

Read our complete coverage of Indian Telecom Industry


Related Articles:


Copyright © Ashish Sinha 2007. » Join startup discussion forum || India Business Channel

Plane overshoots Mumbai as both pilots go to sleep

Filed under: Misc — jeetu @ 8:54 pm

Posted at The Times of India

A major disaster was averted when an Air India Jaipur-Mumbai flight flew well past its destination with both its pilots fatigued and fast asleep in the cockpit.

Amazon Gets Some New Threads, Acquires Fabric.com

Filed under: Misc — Tags: , , — jeetu @ 3:45 pm

Posted at TechCrunch

by Jason Kincaid

Internet-giant Amazon has acquired Fabric.com, an online fabric store that calls itself “The Place To Go When You Sew”. According to the press release, the deal will allow Fabric.com to expand its selection of sewing materials while giving Amazon a better catalog of hobby and craft materials. The cost of the deal was not disclosed.

Fabric.com was launched in 1999, and joins a growing list of Amazon acquisitions that includes dpreview, a camera review site acquired in 2007, and Audible, which was acquired earlier this year.

Crunch Network: MobileCrunch Mobile Gadgets and Applications, Delivered Daily.

Older Posts »