Page Menu
Home
c4science
Search
Configure Global Search
Log In
Files
F93786069
PhabricatorRobotsController.php
No One
Temporary
Actions
Download File
Edit File
Delete File
View Transforms
Subscribe
Mute Notifications
Award Token
Subscribers
None
File Metadata
Details
File Info
Storage
Attached
Created
Sun, Dec 1, 12:08
Size
1 KB
Mime Type
text/x-php
Expires
Tue, Dec 3, 12:08 (1 d, 23 h)
Engine
blob
Format
Raw Data
Handle
22704442
Attached To
rPH Phabricator
PhabricatorRobotsController.php
View Options
<?php
final
class
PhabricatorRobotsController
extends
PhabricatorController
{
public
function
shouldRequireLogin
()
{
return
false
;
}
public
function
processRequest
()
{
$out
=
array
();
// Prevent indexing of '/diffusion/', since the content is not generally
// useful to index, web spiders get stuck scraping the history of every
// file, and much of the content is Ajaxed in anyway so spiders won't even
// see it. These pages are also relatively expensive to generate.
// Note that this still allows commits (at '/rPxxxxx') to be indexed.
// They're probably not hugely useful, but suffer fewer of the problems
// Diffusion suffers and are hard to omit with 'robots.txt'.
$out
[]
=
'User-Agent: *'
;
$out
[]
=
'Disallow: /diffusion/'
;
// Add a small crawl delay (number of seconds between requests) for spiders
// which respect it. The intent here is to prevent spiders from affecting
// performance for users. The possible cost is slower indexing, but that
// seems like a reasonable tradeoff, since most Phabricator installs are
// probably not hugely concerned about cutting-edge SEO.
$out
[]
=
'Crawl-delay: 1'
;
$content
=
implode
(
"
\n
"
,
$out
).
"
\n
"
;
return
id
(
new
AphrontPlainTextResponse
())
->
setContent
(
$content
)
->
setCacheDurationInSeconds
(
phutil_units
(
'2 hours in seconds'
))
->
setCanCDN
(
true
);
}
}
Event Timeline
Log In to Comment