Search Engine Quality

Urgent update meta title

April 28, 2010 · Leave a Comment

Organisations often need to rapidly communicate to end users via the web. If a flight crashes, product is recalled, emergency at the comapany etc….people hit the web first. An ideal function would be the ability to flash update meta desc and title and have search engines flush their cached title and snippet. This means even if the corporate site goes offline under traffic load the serp will display a useful update within the snippet and title. The URL could even be redirected or shown in an emergency sitelink. Thus an airline that has lost a flight would go from a serp entry of:

TITLE: Acme – cheap flights to new York and Chicago.
SNIPPET: Air Acme is the northwests leading low cost airline with superior legroom and free seat reservations
URL: www.airacme.com

TO:

TITLE: Air Acme: Flight 345 NYC -ORD Crash. All other flights safe but grounded.
SNIPPET: Emergency hotline 1 800 000 1234 Air acme. 1 800 367 4566 JFK. 1 800 222 5678 ORD.
Www.Airacme.net/emergency

This ability would be used infrequently but allow search engines to play a better role in assisting organisations and the effected people in crisis. A ping submit used in conjunction with the urgent update tag / flag on the organisation site as the first thing to fall in most crisis situation is often the website. This was what we saw on our web monitors in 9/11 and many times since, leaving families scrabbling for information.

Currently you would have to use PPC to get this turnaround time.

→ Leave a CommentCategories: Uncategorized
Tagged:

Search Quality Evaluator

January 4, 2010 · Leave a Comment

Last year I mentioned to a client in passing that algorithmic results are evaluated and improved by the feedback from Search Quality Evaluators.

The client, a programming background CTO of a well-known brand, was horrified that I dare suggest that search engines have HUMANS doing anything vaguely related to improving search engine results. Despite politely mentioning that I had managed a team of Search Quality Evaluators he remained unconvinced and repeated algorithm in a semi chant as if to banish the notion of humans improving algorithmic processes….amusing since humans clearly create the rules for ranking.

So for those who remain unconvinced, here is a job posting for Google Search Quality Evaluators in India.

Search Quality Evaluator

Search Quality Evaluator

Here is a LinkedIn to a chap who is a contract SQE in India: http://in.linkedin.com/pub/anil-yamarti/16/425/a6a

I may even find a team photo of my old Search Quality Group some time.

While the job title ‘Search Quality Evaluator’ can cover many types of task (12 distinct tasks in my old Ask.com team) the aim is generally the same, to try to avoid one-off changes (a VP’s restaurant venture not ranked in top 5) and to find patterns that will yield significant wide scale improvements and allow refinement to automated processes. Not to mention that hunting down Web Spam, hate, incite, illegal material etc is a full-time job.

Typically when an algo update is made, a team of SQE’s will run diagnostics and while much of the work is automated, the results still need evaluation and also human evaluation of a set % of pages means very well constructed spam doesn’t sneak by as well a vital human qualitative judgement comes into the mix.

True, it can be a very dull task at times, with huge pressure from engineering to review/approve a releases but equally when my team found new types of spam, it could be very rewarding to remove 350,000 pages from the index in a single hit or see a tweak to the ranking rules drop lower quality blogs in favour of slightly older but higher quality websites.  Removing illegal material feels palpably like you’ve made the world a better place.

For the CTO who felt human evaluation of search results was not scalable he was correct but only as he probably understood the role

But it is not an issue of looking at every page, but looking at classes of data, patterns and designing new tests to identify quality issues such as freshness, adult material, radical material, spam etc.

Our critics other assertion was ‘No Google engineer would listen to a quality rater who isn’t an engineer himself’. On this point I suspect he is plain incorrect as the comment belies a fundamental belief that engineers cannot exist outside of teams writing the core code. Now while I cannot speak for Google, I know my Ask.com teams saw the relationship as co-dependant rather than subservient. In my core area of heuristic search I remember many times  when a quality evaluator would draw my attention to needed improvements to patterns I created. I never refuted their comments purely on the basis that they didn’t create the search pattern or understand the vagaries of search architecture.

While Search Quality Evaluators don’t get much press, they are certainly an extremely valuable function in any search engine, every time your search results improve you probably have the Search Quality team at your favoured search engine to thank.

→ Leave a CommentCategories: Search Quality
Tagged:

Duplicate Content Problems

January 4, 2010 · Leave a Comment

Matt Cutt’s Google videos get plenty of traffic, but there are plenty of great videos from other members of the Google Search Quality Team.

Greg Grothaus of Google’s Search Quality Team gives a great run through on Duplicate Content issues.

http://googlewebmastercentral.blogspot.com/2009/09/duplicate-content-and-multiple-site.html

Many duplicate content issues creep in way ahead of SEO teams being involved and are baked in to site architecture. As part of improving search results (rather than SEO per se) it’s important to be aware of the dilutive effect of duplicate content and lack of canonicalisation. Dupe content issues should be part of every web developers formal (or informal) training if we want to see less problems with large, international websites.

Go To: Good Google video on Duplicate Content Issues http://tinyurl.com/dupe-content

→ Leave a CommentCategories: Web Development
Tagged:

Twine Search Engine Poor Relevancy

January 3, 2010 · Leave a Comment

In March 2008 Twine was reviewed poorly by Marshall Kirkpatrick of Read Write Web. As we enter a new decade it seemed like time to kick the tyres again. Unfortunately Twine is still offering a poor search experience and also seems to have fallen foul to spam results. Of the 5o queries I tried (from San Jose, CA) most timed out. The few results Twine provided demonstrated poor relevancy to say the least. Below is a screenshot of Twine results for a fairly easy to handle [flights to NYC] which results in results for Lagos, Nigeria.

twine search engine

twine search engine

Most queries were answered by a long delay and followed by an error page:

Twine Fails 2

Twine Fails 2

Unless things start happening rapidly down Twine way, I suspect Twine will be no more than an interesting search concept that failed to gain scale or achieve relevancy.

→ Leave a CommentCategories: Search Engines
Tagged:

Skype Web Spam

January 3, 2010 · Leave a Comment

Act like a spammer, get treated like a spammer. www.rxrefillshop.net is one of many websites that try to spam Skype users.

An interesting test would be to set up a 100 Skype users with predictable common names (john smith etc) on virtual machines and monitoring the URLs received in messages. I suspect there would be few false positives in the spammy URLs that would be received. These URLs would then be added to a potential spam database. Methods of spam identification that track IM and Skype would also disincentivise spamming of these communication channels.

A relationship between Skype and Search engines would be even more efficient, whereby URLs send more than n times to people not in the users contact book  (and similar spam signals) would have the URL automatically flagged and communicated  to search engine spam teams.

Skype Webspam

Skype Webspam

→ Leave a CommentCategories: Web Spam
Tagged:

Improved Pharmaceutical Website Identification

January 3, 2010 · Leave a Comment

Search Quality Concept: Increased peace of mind for (often elderly) web users by flagging official Pharmaceutical websites.

Ideally Pharmaceutical websites would be flagged in organic search results when they are the official manufacturer website. The simple example below shows how a user could be informed that the site is official. Whilst an imposter site is unlikely to be ranked high for the trademark [Cymbalta], the risks increase as query length increases. A search for [Cymbalta safe dose] shows a spam site at SERP position 8 (eilfqmcegmt.freehostia.com/herbal-vf9/waler.html). 

Improved Pharma Site Identification

Improved Pharma Site Identification

→ Leave a CommentCategories: Search Quality · Search Relevancy
Tagged: ,