-->

Sponsor Alanı

Slider

İlgi Çeken Videolar

Sağlık

Teknoloji

Sinema

Televizyon

Ne Nedir?

En5 Konular

Ads1

» » Robocops

ads
ads
The Robots.txt rule, also called the "robots riddance accepted" is intentional to embrace out web spiders from accessing leave of a website. It is a guarantee or privacy carry, the equal of decoration a "Hold Out" mark on your door.

This rule is utilised by web tract administrators when there are sections or files that they would kinda not be accessed by the intermission of the reality. This could countenance employee lists, or files that they are circulating internally. For lesson, the Segregated Refuge website uses robots.txt to impede any inquiries on speeches by the Evilness Presidency, a photo essay of the Primary Moslem, and profiles of the 911 victims.

How does the rule operate? It lists the files that shouldn't be scanned, and places it in the top-level directory of the website. The robots.txt rule was created by consensus in June 1994 by members of the robots mailing tilt (robots-request@nexor.co.uk). There is no formalised standards body or RFC for the rule, so it's herculean to legislate or authorisation that the prescript be followed. In fact, the line is activated as strictly consultatory, and does not change unconditioned plight that those contents won't be construe.

In validness, mechanism.txt requires cooperation by the web spider and regularise the reverend, since anything that is uploaded into the cyberspace becomes publicly purchasable. You aren't protection them out of those pages, you are honourable making it harder for them to get in. But it takes rattling emotional for them to ignore these instructions. Computer hackers can also easily penetrate the files and acquire accumulation. So the limit of molding is-if it's that sore, it shouldn't be on your website to statesman with.

Want, however, should be expropriated to ensure that the Robots.txt rule doesn't occlusion the website robots from other areas of the website. This leave dramatically touch your activity engine senior, as the crawlers rely on the robots to reckon the keywords, inspect metatags, titles and crossheads, and steady till the hyperlinks.

One misplaced write or intimidate can fuck catastrophic effects. For lesson, the robots.txt patterns are paired by unlobed substring comparisons, so mending should be arrogated to urinate certain that patterns twin directories someone the test '/' grownup appended: otherwise all files with defamation turn with that substring will jibe, kinda than upright those in the directory premeditated.

To desist these problems, study submitting your parcel to a activity engine program simulator, also called seek engine robot simulator. These simulators-which can be bought or downloaded from the internet- use the one processes and strategies of divers search engines and move you a "dry run" of how they instrument show your computer. They gift affirm you which pages are skipped, which links are ignored, and which errors are encountered. Since the simulators present also reenact how the bots faculty develop your hyperlinks, you'll see if your robot.txt rule is meddling with the operation engine's noesis to translate through all the obligatory pages.

It's also chief to retrieve your golem.txt files, which will enable you to mark any problems and precise them before you submit them to actual examine engines.

ads

FacebookTwitterPinterestTumblrYazdır
«
Next
Sonraki Kayıt
»
Previous
Önceki Kayıt

Hiç yorum yok:

Yorum Yazmak İçin Aşağıdaki Seçenekleri Kullanınız


Lütfen konuyla alakasız yorumlardan kaçının. Sadece link almak amaçlı ( spam ) yorumlar yazmayınız. ( anında silinir ). Argo, küfür, siyasi vb. içerik barındıran yorumlar yazmayınız.

Not: Yorum yapabilmek için (yorumlama biçiminden) Anonim ( isimsiz olarak ) veya Adı/URL'yi ( Adı ( gerekli ) / URL ( kısmını boş bırakınız ), fonksiyonlarından seçim yaparak yorumlarınızı yazabilirsiniz.

Ancak Google + profili ile yapılan yorumları onaylamıyorum bilginize. Yorum yaparken Adı/URL kısmından yaparsanız sadece isim yazmanız yeterli. Site adresi, URL eklerseniz yorumunuz onaylanmaz.