A Million Search Apps

January 28, 2011

Curation is the New Search is the New Curation

Filed under: Uncategorized — Shree Pragada @ 11:35 am

Here is an exciting post by Paul Kedrosky on how curation will be key to future search. In the past, when the web was only a couple millions of pages one company like Yahoo could curate a directory of the web. One company couldn’t  curate as the web grew to billions of pages. Now, algorithmic search is also failing to deliver. And, the solution could be curation again. With billions of pages, current web will take more than one company to curate the web. Current web will need a large community curating small focused areas of web together covering a vast majority of the web — just as Semantifi envisions that the future web will be powered by millions of community build/curated search apps covering the broad spectrum of the web.

Here is the link or the complete article:

http://paul.kedrosky.com/archives/2011/01/curation_is_the.html?utm_source=feedburner&utm_medium=email&utm_campaign=Feed:+InfectiousGreed+(Paul+Kedrosky’s+Infectious+Greed)

In the beginning there was curation, and it was good. People found interesting things on the web, created directories of those things, and then you found what you were looking for inside those curated lists. That was the origins of the original lists and directories, from Yahoo on outward.

But then that got too hard. The web got bigger faster than anyone could keep track. Curation steadily gave way to algorithmic search, which at first was just spidering of the web, and then more intelligent spidering with keywords. And then it became Google, with ranking algorithms that placed websites into a hierarchies of keyword-related relevance based on things like authoritativeness, as defined, in part, by links from other sites — by those original hand-curated lists, ironically enough.

That model has now begun to give way too. Any algorithm can be gamed; it’s only a matter of time. The Google algorithm is now well and thoroughly gamed, as I first wrote about late last year, and as now become an entire genre of web writing, and that has grown to include my friend Vivek Wadhwa’s smart piece on TechCrunch not long ago. Google has, they argue, lost its mojo — which is true, but it’s more interesting and complicated than that.

What has happened is that Google’s ranking algorithm, like any trading algorithm, has lost its alpha. It no longer has lists to draw and, on its own, it no longer generates the same outperformance — in part because it is, for practical purposes, reverse-engineered, well-understood and operating in an adaptive content landscape. Search results in many categories are now honey pots embedded in ruined landscapes — traps for the unwary. It has turned search back into something like it was in the dying days of first-generation algorithmic search, like Excite and Altavista: results so polluted by spam that you often started looking at results only on the second or third page — the first page was a smoking hulk of algo-optimized awfulness.

There are two things that can happen now. (Okay, three. We could stop search, which won’t happen.). We could get better algorithms, which is happening to some degree, with search engines like Blekko and others. Or, we could head back to curation, which is what I see happening, and watch new algos emerge on top of that next-gen curation again. Think of Twitter as a new stab at curation, but there are plenty of other examples.

Yes, that sounds mad. If we couldn’t index 100,000 websites in 1996 by hand, how do we propose to do 234-million by hand today?

The answer, of course, is that we won’t — do them all by hand, that is. Instead, the re-rise of curation is partly about crowd curation — not one people, but lots of people, whether consciously (lists, etc.) or unconsciously (tweets, etc) — and partly about hand curation (JetSetter, etc.). We are going to increasingly see nichey services that sell curation as a primary feature, with the primary advantage of being mostly unsullied by content farms, SEO spam, and nonsensical Q&A sites intended to create low-rent versions of Borges’ Library of Babylon. The result will be a subset of curated sites that will re-seed a new generation of algorithmic search sites, and the cycle will continue, over and over.

In short, curation is the new search. It’s also the old search. And it’s happening again, and again. [-]

[Update] A friend points out in IM that all of this makes Yahoo mothballing Delicious, a directory of curated lists, more than a little mistimed. And it’s made pointed and ironic too when you look at what the #2 most bookmarked link is on Delicious right now: Google’s weakening search results

June 22, 2010

Government Transparency: Beyond just putting Government Datasets on the Web to making them Searchable!

Filed under: Uncategorized — Shree Pragada @ 1:29 pm

Returning from Gov 2.0 Expo in Washington got me more interested in this budding Government Transparency phenomenon. At this time it is mostly a lot of buzz and very little tangible progress. Here are my two cents on what it takes to really catalyze Government Transparency:

President Obama campaigned on the vision of Government Transparency and made it a priority for his administration. Last year his CIO, Vivek Kundra, launched DATA.GOV to begin putting government datasets on the Web.

Making government datasets available on the Web as “raw datasets” is a good first step. However, raw datasets are not accessible to the millions of Citizens that they are intended for as manipulating raw datasets can be very time consuming and technically challenging. For government datasets to be truly transparent, they must be searchable on the Web.

We understand the administration’s goal to help citizens become better informed and to better engage them in public discourse. Semantifi can help with this goal by enabling citizens socialize government data & insights using its “free data search platform” at www.SEMANTIFI.com.

Unlike Google, Bing and others that search Web Pages, Semantifi searches structured data and engages the community to publish datasets to make them searchable. Users can ask simple questions to search all published datasets, get meaningful answers in the form of charts & tables and share their observations & insights.

Launched earlier this year, Semantifi.com already hosts many popular government datasets covering Government Spending, Recovery, US Economic Metrics, SEC Filings, Earmarks, Recovery, US Aid, FDIC, US Census, etc.  At Semantifi.com, users can

  • Explore Recovery data with questions likes “Jobs recovered in New York and California”, “Award Amount by Recipient State and Performance District”
  • Discover government spending with questions like “Federal Funding to California versus New York”, “Federal funding amount for quarter 1 year 2008”
  • Research over 22,000 US Economic Metrics with questions like “Unemployed and Housing Starts for last 60 months”, “Business Loans between Jan 2005 and Dec 2009”
  • Investigate SEC Filings of publicly traded companies asking questions like “Sales and Income of Amazon and Best Buy”, “Net Sales of companies with Market Capital over 1 billion”

We believe Semantifi can catalyze the administration’s vision of government transparency by making vast number of government databases searchable using pioneering search technology and engaging a community of citizen publishers.

January 11, 2010

Publisher Console soon to be available. Check out & Suggest Apps Ideas.

Filed under: Uncategorized — semantifi @ 11:13 am

We recently added a page for  App Ideas to our Wiki section of the website.

http://wiki.semantifi.com/index.php/App_Ideas

This page illustrates the kind of Apps anyone can build using Semantifi Search Platform. Since it is a Wiki page please feel free to add your ideas for potential Apps. Cheers.

December 10, 2009

Is One Search Engine Enough? Not By a Long Shot.

Filed under: Search,Uncategorized — Shree Pragada @ 8:34 pm
Tags:

According to Digvijay Lamba of Kosmix, “for search to be viable as a platform in the long-term, it must offer value to everyone involved: the platform creator, the application providers, and the users. The platform creator and application providers need to make money, obviously, while at the same time offering consumers high-quality, easily accessible content for free.  This can be achieved in one of three ways:  The Amazon Model, the eBay Model, or the Facebook model.”

Later he adds, “All three of these models have merit, and it remains to be seen which direction a search platform will take. One thing remains clear: as search evolves it will become harder and harder for one company to do it all. A platform that connects hundreds of search engines together can become a powerful source of innovation allowing it to build a deeper and richer experience for its users. The innovation in search has only just started. We need millions of connected search engines.”

Click here to see the full article.

We at Semantifi believe that this philosophy it is especially true of Deep Web Search which is orders of magnitude larger than the surface Web and hence no one company can search it all. Semantifi has just begun to search this vast Web by

  1. building a platform like App Store by hosting its search technology online for anyone to use to build “Data Search Apps” for free
  2. engaging a community of App Publishers, and
  3. creating value for everyone involved: the platform owner, publishers and users.

With Semantifi as the platform owner, a diverse community of publishers and enough number of Apps covering most web verticals, the structured content can be wired together one dataset at a time making Deep Web Search a reality.

December 9, 2009

Congratulations, ExeCue team. We are selected as a Top Innovator at New England Venture Summit 2009

Filed under: Uncategorized — semantifi @ 2:35 pm

I returned last night from Boston after a good event at New England Venture Summit 2009. Our presentation went well. In short, pretty much everyone I spoke to liked our idea and the opportunity of Deep Web Search. A few called it ambitious. And, no one said it can’t be done.

I will post the presentation later today. And, the video as soon as I get hold of it. Once again, congratulations on being selected a Top Innovator. Cheers!

October 4, 2009

Welcome to SEMANTIFI!

Filed under: Uncategorized — semantifi @ 3:21 pm

Post your comments here.

Theme: Rubric. Blog at WordPress.com.

Follow

Get every new post delivered to your Inbox.