Archives

April 2004 (7)
March 2004 (12)
February 2004 (12)
January 2004 (22)
December 2003 (19)
November 2003 (16)
October 2003 (26)
September 2003 (18)
August 2003 (38)
July 2003 (80)
June 2003 (13)
May 2003 (24)
April 2003 (76)
March 2003 (75)
February 2003 (51)
January 2003 (73)

Category

Family (5)
FYI (18)
Games (2)
Geek (88)
Geographic (3)
Hacks (13)
Home (15)
Humor (54)
Ideas (20)
Ideaspace (15)
Local (15)
Metadata (10)
Microsoft (2)
MovableType (5)
Nitwits (66)
PKI (2)
Politics (22)
Quotes (3)
RDF (15)
RSS (4)
Security (3)
Semantic Web (13)
Site Info (13)
Social Networks (1)
Spam (9)
Sysadmin (1)
Tips (2)
Tivo (2)
TMFTOTHD (1)
To Do (1)
Unlisted (1)
Web (3)
Windows (1)

Local

« MetroBlogs »
DC metroblogs
beltway bloggers

Links


Assorted bits

Blogroll Me!
GeoURL
Listed on BlogShares




April 17, 2003

Throttle that web server!

Overnight I discovered a certain server down in New Zealand has seen fit to start trying to spider one of my servers. The idiots. They're spidering one of their own sites that has an external link to one of mine. Their spider is making it worse by somehow bastardizing the URLs and recursing into non-existent subdirectories. So I'm seeing all sorts of wasted bandwidth and a generally cluttered up server log (~512k so far).

UPDATE: The admins on the box have contacted me. They've shut down the spider (htdig) and offered an apology. Way to go folks! Running these machines can be a tough job. I hope they have some luck getting it reconfigured properly.

What to do? Well, using apache deny directives is a good start. Trouble is their spider must not be alone. Another one of their hosts, from a different subnet, is likewise making these erroneous requests. So if I block entire subnet ranges or individual IP addresses I'm potentially faced with a lot of work. I'm also faced with other legitimate users on those subnets getting blocked. I'd heard about
mod_throttle some time ago. It's been on my 'to check out' list for ages. Now, it seems, I have need for it.

Initial setup looks good. I've got it blocking IP addressess if they request 'too often'. I'm sure I'll have to tweak these numbers a bit. Not to mention refine which directories get this treatment and which don't. As I get a better grip on it I'll be sure to report my experiences.

Geek
Perma  | Comments (0) | TrackBack (0) | 01:13 PM  | xml
Comments
Post a comment






* if you do not leave a valid e-mail or URL your comment may be deleted *







Navigation

Recent Entries

America and Europe: Vive la différence?
Server changes afoot
Diet behavior mod
Googling for sensitive info
Outlook 2003 and IMAP, a marriage made in Hell
Bike to Work Day, May 7th
Speakeasy rocks
Zippo USB?
When geographic data is nowhere 'near' correct
Local campaign contributions

User comments
Trackbacks

Contact

send me an e-mail E-mail
chat with me using MS messenger MSN Messenger
chat with me via AIM America Online
chat with me on ICQ ICQ
chat with me on Yahoo! Yahoo
Add my vCard to your electronic addressbook vCard
Friend of a Friend FoaF

Syndication

XML  RDF  CDF

Comments

XFML

Extra Stuff

foaf
vCard
pgp info
Linked In
Powered by
Movable Type 2.64