How good does e-discovery search need to be?
Two years ago, CEO Mike Lynch of Autonomy tried to persuade me that Autonomy was and would remain dominant in the e-discovery search market because: Read more
Categories: Autonomy, E-discovery, Enterprise search, Search engines | 1 Comment |
Silly Twitter statistic
In April, the widely respected Louis Gray came up with an uncharacteristically silly idea — the ratio between a Twitterer’s number of followers and total tweets. Recently, Ed Kohler posted about essentially the same thing, without obvious attribution. Gray and Kohler both seem to suggest that the number of your number of followers at any one point in time should be viewed as a guide to how many total tweets you should make over your lifetime use of the service.
Huh?
At least the whole line of reasoning isn’t as bad as another one I recently discovered on the subject of information overload.
Categories: Twitter | 5 Comments |
Dubious statistic of the decade
In a 2006 white paper, IBM claimed that “just 4 years from now, the world’s information base will be doubling in size every 11 hours.” This week, that statistic was passed on — utterly deadpan — by the Industry Standard and Stephen Arnold. Arnold’s post actually reads as if he takes the figure seriously.
Now, I’ll confess to not having seen the argument in favor of that statistic. But color me skeptical that, by any measure of “information”, it will grow by a factor of more than 2^730 in a year, or 2^7300 in a decade …
Categories: IBM and UIMA | 5 Comments |
When homonyms go astray
I can’t tell whether a recent comment to a post of mine is a spoof or not. If it is a spoof, it’s very well done. But if it’s serious, how did that commenter find the thread in the first place??
Evidently I’m a social media expert too. Who knew?
Network World asked me to do an online chat. That isn’t surprising. What’s surprising is that they asked me to focus on social media. My views on social media boil down to:
- Get off the stick and blog!
- Social media are a part of life, especially if you have any valued employees under the age of 40. Get used to it.
- The “dangers” of social media are the same as the dangers of other forms of internet communication. If your employees can’t use email or web surf safely, you’re dead anyway. So stop fretting.
The long form of my views on social media — with a little data warehousing thrown in — may be found here.
In somewhat related news, Jason Fry of the Wall Street Journal showed his exquisite good sense by quoting me carefully about online presence, and expanding upon my points at length.
Categories: Blogosphere, Social software and online media | 1 Comment |
LinkedIn name search is ridiculously bad
Somebody named Conor O’Mahony has posted excellent comments about XML databases on a couple of DBMS2 threads. After a look at the blog URL he provided and the job description he posted there, I resolved to look him up. LinkedIn seemed as good a way as any of figuring out where he was geographically located. But on the first try I typed his name from memory as Conor Mahony. LinkedIn had no idea who I meant.
Once I confirmed that he was indeed listed, I went on to test such errors as Connor Mahony, the very common misspelling of my name as Kurt Monash, and several variations on Dan Weinreb. Almost nothing worked. LinkedIn did get Daniel/Dan, and didn’t require the hyphen in Tony Lacy-Thompson, but otherwise pretty much every misspelling I could think of stumped it. Read more
Categories: Search engines, Social software and online media | 9 Comments |
A startup that could improve all our lives
Apostrophee aspires to hugely improve the experience of cyberspace, by applying grammar and spelling correction to online content, especially blog comments and forum posts.
Too bad the article is a spoof.
Reflecting on why it has to be spoof could be somewhat enlightening. 😉
Categories: Blogosphere, Fun stuff, Humor, Social software and online media | 1 Comment |
Lexalytics has merged with part of Infonic
As reported on the Lexalytics blog, sentiment analysis specialist Lexalytics has merged with the text analytics division of Infonic to form Lexalytics Limited. The deal seems to have a screwy financial structure — which Seth Grimes made a valiant effort to decipher (I think from vacation, poor guy) — as is common when companies much too small to be public wind up trading publicly anyway.
Related links
Categories: Lexalytics, Sentiment analysis | Leave a Comment |
Google vs. Microsoft search, per Seth Grimes
Seth Grimes did a head-to-head comparison of Google and Microsoft Live Search results about the Microsoft/DATAllegro deal, 10 hours after it was announced. He found that Google had picked up a number of relevant results, while Live Search hadn’t. (And this was on the main search pages, not on News or Blogs.) He goes on to note that Yahoo’s “contextual” ads were badly irrelevant (Google didn’t have any at all).
What this boils down to, mainly, seems to be a major win in spidering speed for Google vs. Microsoft Live Search.
And yes Seth — I like you too. 🙂
Categories: Google, Microsoft, Search engines | Leave a Comment |
It’s too early to go back to Twitter
For a while, I’ve made very little use of Twitter. The reasons are familiar:
- COMMON outages.
- Temporary disabling of the Replies feature.
- General lessening of the discussion, as other people stay away too.
Back from vacation, I just tried again. My experiences include: Read more
Categories: Twitter | Leave a Comment |