[linux-elitists] elitism vs. laziness

Aaron Lehmann aaronl@vitelus.com
Tue Feb 13 23:29:15 PST 2001


On Tue, Feb 13, 2001 at 10:50:15PM -0800, Don Marti wrote:
> I'm trying pdnsd on my laptop, and it Works Great For Me(tm) so far.
> Thanks to Raghavendra Bhat for mentioning it on this list last week.
> No issues with the debian package, which is how I install almost all
> software now. Damn, I'm a lazy bastard. Still, it's going to get a
> filler item about it written for LJ after I kick it around a little next
> road trip.

For some reason I read this as djbdns at first. Naturally, it did not
make any sense.

I hate bind. It has a bad configuration file and zone file format.
It's a resource hog. It has security holes. Blahblah. Whatever djb
says. No, I will not install djbdns. But I would appreciate a decent
free DNS server (it has to actually serve DNS rather than just cache).

[snip]

> So, that means all the more reason for me to package up Aaron's
> webclean-based squid plugin as a Debian package to save all the poor
> freaks out there from Kevin's excesses. It's much better than junkbuster
> because it uses real genuine super-deluxe regular expressions, not just
> shell patterns, and because it allows easy categorization of annoyances
> into HTML, popup, image, or whatever, and deals with them in the
> appropriate way.
> 
> That means I have to become a member of the elite super-secret Debian
> Society, which means I have to learn gnupg, so please send me some
> gnupg-encrypted mail so I can get the hang of it. Public key at:
> http://zgp.org/~dmarti/gnupg (I know it isn't much good until I go to
> the keysigning Saturday.)

Perhaps it would be easier for me to package it right now, as I've
already wedged myself into this elite society. Or if you want to
package it, I could find a way to get it uploaded before you have
endured your lengthly Debian NM torture.

The packaging would be fun because it involves delightful stuff like
trying to modify the squid configuration, which is probably more
difficult than webclean was in the first place. I would package the Don
Marti dataset as the default filtering data :).

Just to waste bandwidth inexcusably, I've attached the program source
and the data/action filepair I use so fellow elitists can comment. I
expanded the commentary in the source a bit.

BTW, my stats on webclean over the past 21 days (19864 requests)
indicate that it can handle ~750 requests per second on my lame box,
which I guess is not so bad for a filter. I wonder how fast it is with
the squid overhead factored in.

And all elitists should be familiar with GPG...
-------------- next part --------------
A non-text attachment was scrubbed...
Name: webclean.c
Type: text/x-csrc
Size: 4911 bytes
Desc: not available
Url : http://allium.zgp.org/pipermail/linux-elitists/attachments/20010213/ba357468/attachment.c 
-------------- next part --------------
comment: Raging Search title
pattern: ^http://www.altavista.com/.*\.gif
type: annoyance/image

comment: SourceForge banner
pattern: ^https?://www.*\.valinux\.com/adserver
type: ad/image

pattern: ^http://www\.playboy\.com/ad-imx/cyberpopups/
type: ad/popup

pattern: ^http://www\.angelfire\.com/sys/popup_source\.shtml
type: ad/popup

comment: superstitial ads are evil and must die.  No, wait, that's all ads.
pattern: ^http://adcontroller\.unicast\.com
type: ad/html

comment: preferences.com "web bug" ^http://www.tiac.net/users/smiths/privacy/wbfaq.htm
pattern: ^http://media\.preferences\.com
type: bug/image

comment: vcall.com needs a bit more cleaning up I would say.
pattern: ^http://(www\.)?vcall\.com/doubleclick/
type: annoyance/html

comment: ad frame
pattern: ^http://(www\.)?ITworld\.com/agentIT
type: annoyance/html

comment: abcheadline flashy thing
pattern: ^http://image\.weather\.com/cobrand/abcnews/weatherheadlines\.gif
type: annoyance/image

pattern: ^http://www\.weather\.com/creatives/twc/.*\.gif
type: ad/image

pattern: ^http://www\.cnn\.com/images/1999/10/cnnstore\.gif
type: annoyance/image

comment: stat counter box with nasty animated graphics
pattern: ^http://stats\.hitbox\.com/buttons/.
type: ad/image

comment: I am not sure if I should merge this with the previous one
pattern: ^http://hg1\.hitbox\.com
type: ad/image

comment: stat counter box with nasty animated graphics
pattern: ^http://aphrodite\.porntrack\.com:80/.
type: ad/image

comment: stat counter box with nasty animated graphics
pattern: ^http://.*\.sextracker\.com/.
type: ad/image

comment: stat counter box with nasty animated graphics
pattern: ^http://c1\.thecounter\.com/.
type: ad/image

comment: stat counter box with nasty animated graphics
pattern: ^http://206\.132\.234\.218/.*\.gif
type: ad/image

comment: LinuxWorld magazine nav frame
pattern: ^http://.+\.itworld\.com/agentIT/itw\.html$
type: ad/html

pattern: ^http://home\.netscape\.com/bookmark/4_7/tshop\.html
type: ad/html

pattern: /homepopup.html$
type: ad/popup

pattern: ^http://.*/adserve/adserve.dll
type: ad/image

pattern: BannerAds/
type: ad/image	

comment: a standard ad size...
pattern: 468x60.*\.gif$
type: ad/image

comment: mpl.net
pattern: ^http://205\.153\.208\.93/.+\.gif
type: ad/image

comment: MatchLogic == pigfuckers!!! 208.176.169.0 - 127
pattern: ^http://208\.178\.169\.[0-9]+.*\.gif$
type: ad/image

comment: doubleclick.net
pattern: ^http://.*doubleclick\.net/(view)?ad/
type: ad/image

comment: ad.doubleclick.net
comment: doubleclick.net 1-letter server names
pattern: ^http://(ad|.)\.doubleclick\.net
type: ad/image

conmment: ln.doubleclick.net  -- don't know what it is, but I don't like it.
pattern: ^http://ln\.doubleclick\.net
type: ad/html

comment: businessweek small ad
pattern: ^http://.*businessweek\.com/sponsors/.*\.gif$
type: ad/image

comment: cbs MarketWatch popup
pattern: ^http://cbs\.marketwatch\.com/.*_popup\.htx
type: ad/popup

# comment: gifs must die.
# pattern: ^http://images.slashdot.org/.*\.gif$
# type: annoyance/image


comment: tripod popup 20 November 1999
pattern: ^http://.*tripod\.com/adm/popup
type: ad/popup

comment: netscape popup 17 November 1999
pattern: ^http://.*/netscape\.com/misc/snf/.*\.html$
type: ad/popup

comment: adfu this...
pattern: ^http://.*/servfu.pl
type: ad/image

pattern: ^http://images\.zdnet\.com/adverts
type: ad/image

pattern: ^http://.*\.yimg\.com/.*\.gif$
type: ad/image

pattern: ^http://([^\.]+\.)*burstnet\.com/cgi-bin/ads
type: ad/image

pattern: ^http://.*/RealMedia/ads/
type: ad/image

pattern: ^http://www\.geocities\.com/toto
type: ad/popup

pattern: ^http://macprovider\.com/images/.*banner
type: ad/image

comment: Linux Today ads
pattern: ^http://linuxtoday\.com/.*\.gif
type: ad/image

comment: linux.org
pattern: ^http://[^/]*linux\.org/graphic/square/
type: ad/image

comment: AdForce
pattern: ^http://.*(imageserv[0-9]*|cache)\.imgis\.com
type: ad/image

comment: directory "ads", "adimages",
pattern: ^http://.*/ad(image)?s/
type: ad/image

comment: server named "ad" "ads" or "adserver" optionally followed by a number
pattern: ^http://ads?(server)?[0-9]*\.
type: ad/image

comment: /cgi-bin/ad/ can't be nice, can it?
pattern: ^http://[^/]+/cgi-bin/ads?/
type: ad/image

pattern: ^http://205\.153\.208\.93/.+\.gif
type: ad/image

pattern: ^http://www\.usatoday\.com/sponsors/.
type: ad/image

pattern: ^http://www\.bannerfx\.com/cgi_bin/.
type: ad/image

pattern: ^http://.+\.miningco\.com/zadz/.
type: ad/image

pattern: ^http://12\.16\.1\.10/.*\.gif$
type: ad/image

pattern: ^http://offers\.egroups\.com/images/
type: ad/image

comment: real.com popup
pattern: ^http://([^\.]\.)*real\.com/.*_pop\.html
type: ad/popup

comment: match a lot of ads.
pattern: ^http://([^/]+/)*[Aa]d(v(ert(is(ments?|ing)?)?)?)?(graph(ic)?)?(gif)?s?[0-9]*[\./].+\.(gif|jpe?g)
type: ad/image

comment: smartbanner servers
pattern: ^http://([^/]+/)*(Smart)?[Bb]ann?e?r(gif)?s?[0-9]*\.?/.+
type: ad/image

comment: hosts named "adfinity" something
pattern: ^http://adfinity[0-9]*\.
type: ad/image

comment: need to check this one
pattern: ^http://([^/]+/)liveads/
type: ad/image

pattern: ^http://[^/]*247media\.com
type: ad/image

pattern: ^http://[^/]*.ngadcenter\.net/.
type: ad/image

pattern: ^http://[^/]+/[^\.]+\.ng/.+=.
type: ad/image

pattern: ^http://home\.netscape\.com/inserts/images
type: ad/image

pattern: ^http://199\.172\.144\.25/.
type: ad/image

pattern: ^http://[^/]+\.flycast\.com/.
type: ad/image

pattern: ^http://crux\.songline\.com:8080/.
type: ad/image

pattern: ^http://image\.avenuea\.com/.
type: ad/image

pattern: ^http://www\.goto\.com/.+\.gif$
type: ad/image

pattern: ^http://cds\.mediaplex\.com/.
type: ad/image

pattern: ^http://gp\.dejanews\.com/.
type: ad/image

pattern: ^http://207\.54\.137\.73/img/.
type: ad/image

pattern: ^http://www\.ad-up\.com/cgi-bin/view.cgi
type: ad/image

pattern: ^http://www\.PostMasterBannerNet\.com/.+\.gif$
type: ad/image

pattern: ^http://image\.linkexchange\.com
type: ad/image

pattern: ^http://adimage\.blm\.net/.
type: ad/image

comment: suck.com ad frames -- now suck.lycos.com...
pattern: ^http://([^\.]+\.)?suck\.(lycos\.)?com/sponsors/.*\.html$
type: ad/html

comment: theonion.com ad frames
pattern: ^http://(.+\.)?theonion\.com/adframes/.
type: ad/html

comment: theonion.com ads
pattern: ^http://.+\.theonion\.com/ad_graphics/.
type: ad/image

comment: "come.to" realize popups suck!
pattern: ^http://come\.to/pop\.asp
type: ad/popup
-------------- next part --------------
type: .*/image
replacement: http://localhost/webclean/dot_clear.png

type: .*/popup
replacement: http://localhost/webclean/popdown.html

type: .*/html
replacement: http://localhost/webclean/black.html

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://allium.zgp.org/pipermail/linux-elitists/attachments/20010213/ba357468/attachment.html 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: dot_clear.png
Type: image/png
Size: 261 bytes
Desc: not available
Url : http://allium.zgp.org/pipermail/linux-elitists/attachments/20010213/ba357468/attachment.png 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://allium.zgp.org/pipermail/linux-elitists/attachments/20010213/ba357468/attachment-0001.html 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 232 bytes
Desc: not available
Url : http://allium.zgp.org/pipermail/linux-elitists/attachments/20010213/ba357468/attachment.pgp 


More information about the linux-elitists mailing list