Search engines
Analysis of search technology, products, services, and vendors. Related subjects include:
Google is idiosyncratic about what it displays
I was testing the new blog theme installed on Software Memories, specifically to see whether the title and description in the search engine results reflected the metatag title and description I’d just put in, which are
History of the software industry, its companies and its personalities
and
History of the software industry by Curt Monash, who’s been in the middle of it since 1981
respectively.
Well, the answer turns out to be a resounding “Yes and no.” Read more
Categories: Google, Search engine optimization (SEO), Search engines | Leave a Comment |
How is YouTube relating videos?
One of the great music videos of all time is Madonna’s Material Girl. With two exceptions, all the “related videos” listed by YouTube are just what one would expect: either other Madonna videos, or other versions of Material Girl. One exception is Cyndi Lauper’s Girls Just Want to Have Fun, while the other is Marilyn Monroe’s Diamonds Are A Girl’s Best Friend. The connection with the Monroe video is particularly strong, with each being #3 on each other’s “Related” list.
And that’s an outstanding result. Material Girl is obviously a direct reference, conceptually and visually, to Diamonds Are A Girl’s Best Friend. So my question is: How does YouTube know that? Are there favorite videos lists on which they co-exist? Did somebody hand-enter the connection? Is it inferred from their comment threads (which I definitely have not paged through)? Or — by far the least likely but most interesting of all — is there some sort of direct visual comparison?
Other than popularity presumably having something to do with it (both videos are, deservedly, very often watched and commented on), I haven’t figured out which it is.
Categories: Audio and video search, Google, Search engines | 4 Comments |
Powerset is mildly interesting
Powerset has done a great job of generating buzz for it’s version of smart search. That said, its current demo is mediocre — and that’s being polite. Powerset currently indexes little more than just Wikipedia, and the quality of its search results is about comparable to that of Wikipedia’s justly reviled internal search engine. To determine this, I did searches on both sites on five strings. Wikipedia typically had more total junk ranking higher, but it also put the very best hits of all higher than Powerset did. The strings were:
- Drosophila research
- Bill Clinton foreign policy
- Home run hitters
- Innocents on death row
- Text data mining
Categories: Powerset, Search engines, Structured search | 4 Comments |
Google could dominate single-site search
Google has begun to introduce a feature whereby, if your search obviously leads you to a single site (e.g., you searched on a company name), you get a second search box to search only within that site. More details at Google and Search Engine Land. Basically, this is Google Site Search made a lot easier to use.
I think this could be a really big deal. Read more
Categories: Enterprise search, Google, Search engines, Specialized search | 4 Comments |
Code search options
Questions come up here from time to time about code search engines, a subject I have not researched. Well, here’s a quick link listing some leading code search engines, both Web (guess who?) and internal. Most interesting may be that the list is so short.
Categories: Search engines, Specialized search | Leave a Comment |
Survey of search UI alternatives
I’ll confess to skimming rather than reading this long, footnoted discussion of search user interfaces. But if you need to design those things — even internally at an enterprise — it’s worth at least a quick look. Ditto, perhaps, if you design other analysis- or research-oriented UIs.
Categories: Enterprise search, Search engines | Leave a Comment |
Coveo highlights
I talked yesterday with enterprise search vendor Coveo. Here are some highlights.
- Coveo spun out of Copernic a few years ago. The only relationship between the companies now is that Coveo licenses Copernic’s desktop search product.
- Coveo has 60 employees.
- Coveo has 5-600 customers, including lots of big-name companies.
- Coveo’s pitch boils down to “inexpensive, easy to install, and no-apologies functionality.” Actually, Coveo also claims superior relevance and performance, but I’m not going to comment much on those until I have a chance for a more technical discussion.
- Example of ease of set-up: Coveo says Factiva downloaded the product on a Monday, called up and bought it on Thursday, and deployed it in production that Friday. This may be a growing industry trend. Attivio also features a “download first, talk to us second” distribution model. So do vendors of other kinds of “platform” software such as database management systems, application servers, or complex event/stream processing.
- Average selling price: $50K. Everything is included for one price unless it requires bundled third-party software (as is the case for audio, video, and OCR search).
- Coveo claims 90% head-to-head win rates vs. Google OneBox and Microsoft Sharepoint search. Generally, customers have other search products too (I guess that’s obvious, since Coveo has only been around 2-3 years). Sometimes they even have all-you-can-eat licenses to competitive products, but buy from Coveo anyway. Rule of thumb: Nobody’s head-to-head win rate is truly as high as they like to think, but companies that think their rate is 90% generally are doing quite well.
- Coveo cites a strong demand for text search of relational databases. Based on specific examples cited, this seems to mean text fields such as call center notes.
- Coveo offers audio/video search. Really, it’s just an audio search technology; what’s being searched on in videos is the audio part. And the audio search boils down to a speech-to-text transcription, with a search of the resulting text. Coveo’s key claim is that the error-laden text you get from speech-to-text conversion is sufficient for useful searching. Specifically, you do best searching for unusual words, such as proper names. In the case of telephone calls, which are low quality – perhaps 32 kb/sec – Coveo says there’s only 10-20% accuracy in word transcription. However, Coveo also says that the words that do come through are exactly the unusual ones most usefully searched on.
- Coveo also says that its speech-to-text lexicon is initially strengthened by text crawls. In general, while I didn’t ask, I would guess that the easy-installation story involves a fair amount of automated lexicon enhancement.
Categories: Audio and video search, Coveo, Enterprise search, Search engines | 5 Comments |
Microsoft, Yahoo, and innovation
Bill Burnham argues that a Microsoft/Yahoo merger would drive down M&A prices. Marc Andreesen disagrees. His argument is essentially twofold:
- Microsoft and Yahoo were never more than a small part of the exit opportunity anyway.
- A merged Microsoft/Yahoo will be so slow-moving it will create more opportunities for competition than it destroys.
Andreesen certainly knows about slow-moving behemoths making wasted acquisitions; Netscape was acquired by two companies (AOL and Sun) that both dribbled away the parts they respectively acquired.* However, I think he and a lot of other observers are missing something this time — the Microsoft/Yahoo synergies are too large to ignore.
*The legalities of the merger were a lot more complicated than that, but in essence AOL got the “internet” piece of Netscape and Sun got the enterprise side.
Given the opportunity, here are some reasons I think integration would go a lot better than most people think: Read more
Categories: Enterprise search, Microsoft, Search engines, Social software and online media, Yahoo | 1 Comment |
19 Microsoft/Yahoo synergies that could revolutionize the Internet
Many – perhaps most — commentators on Microsoft’s bid for Yahoo are thoroughly missing the point. The most interesting part of Microsoft’s bid for Yahoo isn’t the horse-race retrospective “How did they screw up so much as to need each other?” It’s not the incipient bidding war for Yahoo. And it’s certainly not the antitrust implications.
The Microsoft/Yahoo combination could revolutionize the Internet. I’m serious. The opportunities for huge synergies might just be enough to blast the merged companies out of their current uncreative, Innovator’s Dilemma funks. Search is open for radical transformation in user interface, universal search relevancy, Web/enterprise integration, and just about everything to do with advertising and monetization. Email stands to be utterly reinvented. Portals and business intelligence have only scratched the surface of their potential. And social networking is of course in its infancy.
Here’s an overview of where some synergies and opportunities for a combined Microsoft/Yahoo lie. Read more
Categories: Enterprise search, Google, Microsoft, Search engines, Social software and online media, Spam and antispam, Website filtering, Yahoo | 15 Comments |
Implications of Microsoft’s bid for Yahoo
As I write this, Microsoft has just announced an offer to acquire Yahoo. Early responses from the likes of Danny Sullivan, Henry Blodget, the Download Squad, TechCrunch, Raven SEO, Mashable, and others seem to boil down to:
- Wow.
- Both sides needed it.
- Yahoo wasn’t going anywhere fast on its own.
- Microsoft wasn’t going anywhere fast in search on its own.
- This may be enough critical mass to matter.
- Conference call at 8:30 am
I’ll try to be a bit more analytical than that, but this is still going to be quick. Assuming the deal goes through:
- Microsoft will recombine both parts of the old FAST/alltheweb.com Therefore, Microsoft will be able to use the same technology for web and enterprise search, to the extent that such commonality makes sense.
- I’d expect Microsoft to try to differentiate its technology via faceted/structured search. That’s a FAST strength.
- The old FAST search-as-BI dream might become pretty appealing to Microsoft/Yahoo.
- In a non-search point, Microsoft is strong in games and Yahoo is strong in fantasy sports. Look for some synergies.
- There sure would be a whole lot of non-Windows technology inside Microsoft. 🙂
Basically, Microsoft is a company that’s a lot more sophisticated in its thinking about user interfaces and experiences than Yahoo is. That’s where the really interesting competitive innovation would be most likely to occur.
Categories: Enterprise search, FAST, Microsoft, Search engines, Structured search, Yahoo | 6 Comments |