Archive for the 'Google' Category

Immunity on Duplicate Content

on Tuesday, June 5th, 2007

I’m trying to understand how google handles the duplicate contents and wondering why some sites are luckier than others, almost having an immunity for duplicate content issues.
Some collective sites like answers.com is seems to be immune to the duplicate content penalties.
See this page about smokey bear(PR4). Correct me if I’m wrong but all I’m seeing is a rip of wikipedia page(PR6) along with lots and lots of advertisements.
Who can really say that this page benefits surfers when wikipedia page is exactly the same thing. I guess answers.com is taking this duplication tactics to the roof. Google is also helping them to drive traffic by hardlinking to their content as the “definition” reference.
I’m expecting no comments from Matt Cutts :), just thinking laudly.

Missing categories in Google’s Directory

on Monday, June 4th, 2007

I was checking for the Free Content category on DMOZ.org and I saw it’s updated. I also tried the check the google version. Big surprise! The category does not exist in google’s dmoz copy.

I’m not sure if this is on purpose, but it’s most likely that google is also moderating the biggest moderated directory on the net. Hehe that’s a good one.

See it yourself (I’m not sure for how long this links will be alive)
Dmoz link : updated around June 4th
Google’s Link: doesn’t exist as of June 5th

Also I’ve found if you search for url that exist on dmoz’s certain category, google returns a result but the link is simply for oh for.
See the first link.

Last notes: Yahoo’s directory doesn’t have the category too. I kind of understand it since the category is under Business branch and it’s a paid service for yahoo to appear under that branch.

Sitemaps in the robots.txt Happy Harvesting

on Wednesday, April 11th, 2007

I’ve just read the Google Webmaster’s blog about the news on ask.com supporting Sitemaps.org’s sitemap format.
This is really a great news for all the people that like to be crawled faster and acurately.
For me the more interesting part about this news is that sitemaps.org’s proposal to include sitemaps into the robots.txt.

Simply you add a line into your robots.txt saying

Sitemap: <sitemap_location></sitemap_location>

This part is really cool but for site harvesters it’s an unbelivable tool. So you can handover the key to your site and web harvesters can crawl your site really easily because probably you’ve put all your site’s pages into your sitemap.

Sounds like a good plan in an ideal world. With all the cloakers and content scrapers you must be really smart not to be ripped apart.

My suggestion is to know who you’re serving the sitemap. Currently Google, Yahoo and Ask is supporting this sitemaps.xml and no other site has anything to do with it.
Here is a simple check you can add in the begining of your sitemap thing:

< ?php
    function botIsAllowed($ip){
        //get the reverse dns of the ip.
        $host = strtolower(gethostbyaddr($ip));
        $botDomains = array('.inktomisearch.com',
                                     '.googlebot.com',
                                     '.ask.com',
                             );

        //search for the reverse dns matches the white list
        foreach($botDomains as $bot){
            if (strpos(strrev($host),strrev($bot))===0){
                $qip= gethostbyname($host);
                return ($qip==$ip);
            }
        }
        return false;
}

if (!botIsAllowed($_SERVER['REMOTE_ADDR'])){
    echo "Banned!";
    exit;
}
?>

I’m sure everyone can get the idea of reverse dns and forward dns checking.
If I missed any decent site that uses the sitemaps let me know.

Note: If you’re still using static sitemaps (!) you can just include the xml after the code.

Google Base Banned Keywords

on Sunday, March 11th, 2007

I started submitting some items to google base using the API.
It was all going fine until I realized some keywords were banned from the description, title or URL.
I was posting the free articles I have, about various subjects and my purpose was not spamming.
It’s kind of weird if you want to post a news about anti-online gambling and to get banned.

It’s a stupid keyword match, not a context match. This means you can’t really post anything about those keywords even if you’re writing against or for.

Here is the list I’ve found that is banned. Maybe there are more.

Update: I’ve found this one which is really weird: iraqi dinar

Read more…

New Ad Presentation from Adsense

on Thursday, December 21st, 2006

I realized a new presentation of “Ads by Google” thing on the adsense advertisments
Looks really niceadsense new

Google Wins By Losing

on Wednesday, November 1st, 2006

Interesting article on googles non-search products and how they’re expanding their business.

Some of Google’s non-search projects are really extensions of its search monetization, and are likely to succeed. But others projects mean entering areas where Google doesn’t have much experience, and is taking a risk. With regard to those riskier areas, the key question for Google’s future is whether it can realize that losing is really one of the best assets the company has.

Read more on:
mediapost

eBay bans Google checkout

on Friday, July 7th, 2006

It’s a surprising news that eBay is banning Google checkout.
Are they too scared to compete with Google.
Checkout the accepted payments list on eBay.

Payment Services permitted on eBay: Allpay.net, Bidpay, Canadian Tire Money, cash2india, CertaPay, Checkfree.com, hyperwallet,com, Moneybookers.com, Ozpay.biz, Paymate.com.au, Propay.com, XOOM

Payment Services not permitted on eBay: AlertPay.com, anypay.com, AuctionChex.com, AuctionPix.com, BillPay.ie, ecount.com, cardserviceinternational.com, CCAvenue, ecount, e-gold, eHotPay.com, ePassporte.com, EuroGiro, FastCash.com, Google Checkout, gcash, GearPay, Goldmoney.com, graphcard.com, greenzap.com, ikobo.com, Liberty Dollars, Moneygram.com, neteller.com, Netpay.com, Nochex.com, paychest.com, payingfast.com, Payko.com, paypay, Postepay, Qchex.com, rupay.com, scripophily.com, sendmoneyorder.com, stamps, Stormpay, wmtransfer.com, xcoin.com

IE7 did it again

on Wednesday, February 8th, 2006

It’s a big surprise for me to see Google Adsense is not working on IE7.
It’s because IE7 is blocking third party javascripts. It’s really a matter of Microsoft to decide to block Google.
Maybe Google Adsense will re-write the Adsense code to show it as a local javascript not to loose their 2 billion share in the pay per click scene.
And this will affect other ad programs like chitika, clicksor or else.
Say goodbye to your javascript ads if this feature on IE7 becomes official.
However it’s only in Micro$oft’s hand to do an exception for certain javascripts to run cross domain, but I don’t guess they will do that favour to Google any soon.

Google AdSense Account Disabled ?!?!!!

on Saturday, January 21st, 2006

It was a regular day that I checked my google adsense account and I saw that message:

No AdSense Account
An AdSense account does not exist for this login, as you have not yet completed an application. If you are using your AdWords login, you can create an AdSense account with the same login.

I was really confused, I checked my emails an the bad news: Google AdSense Account Disabled.
Read more…

Google is backing up Firefox

on Tuesday, November 29th, 2005

It seems the guys at Google backing up Firefox very well.
I’ve just seen the new option in Google Adsense for new Firefox banners.

So go ahead and download then install…
Paysback $1.00 to the publisher. ;)