When Google Goggles launched over a year ago, I was underwhelmed by a visual search app that seemed to
have been released way before it was genuinely useful.
The standalone app consistently struggled with bar codes and product labels. The results were uneven at best. On Android, Goggles
has its own dedicated version. On the iPhone it is embedded into the Google search app. I was alerted to an iPhone update the other day that caught my eye. The release notes suggested that Google
visual searches now could solve Sudoku puzzles. I haven't a clue if this works, by the way, but only because I have carefully avoided this puzzle craze for years. I don't even know the basic
rules -- let alone what a successfully solved puzzle looks like. "It is very relaxing," my fiancée, an unapologetic math geek, assures me. I, on the other hand, am math-averse. I get
regular notices from my bank that I miscalculated the basic addition on my last deposit slip... by a couple of thousand dollars.
So, I can neither confirm nor deny that Google Goggles
now can solve a Sudoku puzzle. I just aimed the Android version of the app at a Web-based puzzled and it recognized that it was Sudoku and seemed to deliver something resembling a solution.
Google has an entertaining demo of the feature, pitting its technology against a champion solver.
More
to the point, the Sudoku trick is really meant as a broader demo of the evolution of visual search. In fact, the latest Goggles release for the standalone Android app and iPhone Google search app is a
very strong advance over what I saw a year ago. The Android app now has an instant bar code reader that automatically recognizes when a UPC code is in the camera's view and calls up the search
result without even snapping the camera. Some other bar code readers have had this, of course, but the added convenience in Goggles just makes the casual user think of it as an all-in-one shopping
resource.
In fact, while Google may have cornered the market in mobile Sudoku solving, it is also reaching out to cover categories of visual search that Amazon, eBay, Microsoft Tag and
others have been working on. In the well-integrated iPhone Google app especially, Google clearly is trying to become a multi-mode search app that precludes the need for brand-specific solutions.
Product recognition has improved noticeably. Indeed, older, failed scans that were stored in the app's history now deliver more accurate results when I run them again. Google seems to extract from
a given object both image info as well as text to come up with a couple of possible solutions. In the iPhone search app, many of the results on products offered localized results, although these local
retail shopping options usually were few to none. Still, the Google app is clearly headed in the direction of the recently introduced and very good Amazon scan app that delivers online pricing from
multiple vendors on many goods, plus Amazon's own catalog listing.
I was most impressed by Goggles' indexing of magazine ads. Google says it can understand the ads
in most major magazines published after last August. The scanner successfully recognized almost every print ad I could throw at it from recent issues of Entertainment Weekly and American
Heritage. The scans were so sensitive that the camera even captured and understood the cover text "American Heritage" that got into a shot of an EW page.
The
recognition piece of the puzzle has been solved more effectively than the results, however. In the end, Google is still kicking you into a general search result. The HTC phone ad or the Rosetta Stone
language software ad just links to a manufacturer's general Web site -- still not the kind of directed brand experience a user would get (hopefully) from a mobile code program via Microsoft Tag or
JagTag. Amazon and eBay still tend to keep users within their own mobile-formatted experiences, likely getting users close to a buy button faster.
Nevertheless, Goggles is worth a new
look. The art of visual search clearly has advanced significantly in just a year. The next obvious step for this app is to encroach on the nascent augmented reality search services that we already see
tested in apps like Junaio and Acrossair. The most compelling prospect for the Google app is the way in which it is integrating multiple inputs -- text, proximity, voice and visual.
I have
already gotten accustomed to the idea that voice search is remarkably accurate and fast. I rarely type a search entry any more. If the user learns that visual search is as reliable as these other
input methods, then we have a mobile search platform that is as integrated and simple for consumers to use -- as it may be complex and challenging for marketers to master.