8. Actions Files

Confuse log analysis, custom applications

Sends a user defined HTTP header to the web server.

Multi-value.

Any string value is possible. Validity of the defined HTTP headers is not checked. It is recommended that you use the "X-" prefix for custom headers.

This action may be specified multiple times, in order to define multiple headers. This is rarely needed for the typical user. If you don't know what "HTTP headers" are, you definitely don't need to worry about this one.

+add-header{X-User-Tracking: sucks}

8.5.2. block

Block ads or other obnoxious content

Requests for URLs to which this action applies are blocked, i.e. the requests are not forwarded to the remote server, but answered locally with a substitute page or image, as determined by the handle-as-image and set-image-blocker actions.

Type:

Boolean.

Parameter:

N/A

Notes:

Privoxy sends a special "BLOCKED" page for requests to blocked pages. This page contains links to find out why the request was blocked, and a click-through to the blocked content (the latter only if compiled with the force feature enabled). The "BLOCKED" page adapts to the available screen space -- it displays full-blown if space allows, or miniaturized and text-only if loaded into a small frame or window. If you are using Privoxy right now, you can take a look at the "BLOCKED" page.

A very important exception occurs if both block and handle-as-image, apply to the same request: it will then be replaced by an image. If set-image-blocker (see below) also applies, the type of image will be determined by its parameter, if not, the standard checkerboard pattern is sent.

It is important to understand this process, in order to understand how Privoxy deals with ads and other unwanted content.

The filter action can perform a very similar task, by "blocking" banner images and other content through rewriting the relevant URLs in the document's HTML source, so they don't get requested in the first place. Note that this is a totally different technique, and it's easy to confuse the two.

Example usage (section):

{+block} # Block and replace with "blocked" page .nasty-stuff.example.com {+block +handle-as-image} # Block and replace with image .ad.doubleclick.net .ads.r.us

8.5.3. crunch-incoming-cookies

Prevent the web server from setting any cookies on your system

Deletes any "Set-Cookie:" HTTP headers from server replies.

This action is only concerned with incoming cookies. For outgoing cookies, use crunch-outgoing-cookies. Use both to disable cookies completely.

It makes no sense at all to use this action in conjunction with the session-cookies-only action, since it would prevent the session cookies from being set.

Example usage:

+crunch-incoming-cookies

8.5.4. crunch-outgoing-cookies

Prevent the web server from reading any cookies from your system

Deletes any "Cookie:" HTTP headers from client requests.

This action is only concerned with outgoing cookies. For incoming cookies, use crunch-incoming-cookies. Use both to disable cookies completely.

It makes no sense at all to use this action in conjunction with the session-cookies-only action, since it would prevent the session cookies from being read.

Example usage:

+crunch-outgoing-cookies

The name of a filter, as defined in the filter file (typically default.filter, set by the filterfile option in the config file). Filtering can be completely disabled without the use of parameters.

Notes:

For your convenience, there are a number of pre-defined filters available in the distribution filter file that you can use. See the examples below for a list.

This is potentially a very powerful feature! But "rolling your own" filters requires a knowledge of regular expressions and HTML.

Filtering requires buffering the page content, which may appear to slow down page rendering since nothing is displayed until all content has passed the filters. (It does not really take longer, but seems that way since the page is not incrementally displayed.) This effect will be more noticeable on slower connections.

The amount of data that can be filtered is limited to the buffer-limit option in the main config file. The default is 4096 KB (4 Megs). Once this limit is exceeded, the buffered data, and all pending data, is passed through unfiltered. Inappropriate MIME types are not filtered.

At this time, Privoxy cannot (yet!) uncompress compressed documents. If you want filtering to work on all documents, even those that would normally be sent compressed, use the prevent-compression action in conjunction with filter.

Filtering can achieve some of the same effects as the block action, i.e. it can be used to block ads and banners. But the mechanism works quite differently. One effective use, is to block ad banners based on their size (see below), since many of these seem to be somewhat standardized.

Feedback with suggestions for new or improved filters is particularly welcome!

Example usage (with filters from the distribution default.filter file):

+filter{html-annoyances} # Get rid of particularly annoying HTML abuse.

+filter{js-annoyances} # Get rid of particularly annoying JavaScript abuse

+filter{banners-by-size} # Kill banners based on their size for this page (very efficient!)

+filter{banners-by-link} # Kill banners based on the link they are contained in (experimental)

+filter{img-reorder} # Reorder attributes in <img> tags to make the banners-by-* filters more effective

+filter{content-cookies} # Kill cookies that come sneaking in the HTML or JS content

+filter{popups} # Kill all popups in JS and HTML

+filter{webbugs} # Squish WebBugs (1x1 invisible GIFs used for user tracking)

+filter{fun} # Text replacements for subversive browsing fun!

+filter{frameset-borders} # Give frames a border and make them resizeable

+filter{refresh-tags} # Kill automatic refresh tags (for dial-on-demand setups)

+filter{nimda} # Remove Nimda (virus) code.

+filter{shockwave-flash} # Kill embedded Shockwave Flash objects

+filter{crude-parental} # Kill all web pages that contain the words "sex" or "warez"

+filter{js-events} # Kill all JS event bindings (Radically destructive! Only for extra nasty sites)

8.5.9. handle-as-image

Mark URLs as belonging to images (so they'll be replaced by images if they get blocked)

This action alone doesn't do anything noticeable. It just marks URLs as images. If the block action also applies, the presence or absence of this mark decides whether an HTML "blocked" page, or a replacement image (as determined by the set-image-blocker action) will be sent to the client as a substitute for the blocked content.

Type:

Boolean.

Parameter:

N/A

Notes:

The below generic example section is actually part of default.action. It marks all URLs with well-known image file name extensions as images and should be left intact.

Users will probably only want to use the handle-as-image action in conjunction with block, to block sources of banners, whose URLs don't reflect the file type, like in the second example section.

Note that you cannot treat HTML pages as images in most cases. For instance, (in-line) ad frames require an HTML page to be sent, or they won't display properly. Forcing handle-as-image in this situation will not replace the ad frame with an image, but lead to error messages.

Example usage (sections):

# Generic image extensions: # {+handle-as-image} /.*\.(gif|jpg|jpeg|png|bmp|ico)$ # These don't look like images, but they're banners and should be # blocked as images: # {+block +handle-as-image} some.nasty-banner-server.com/junk.cgi?output=trash # Banner source! Who cares if they also have non-image content? ad.doubleclick.net

8.5.10. hide-forwarded-for-headers

Improve privacy by hiding the true source of the request

Deletes any existing "X-Forwarded-for:" HTTP header from client requests, and prevents adding a new one.

It is fairly safe to leave this on.

This action is scheduled for improvement: It should be able to generate forged "X-Forwarded-for:" headers using random IP addresses from a specified network, to make successive requests from the same client look like requests from a pool of different users sharing the same proxy.

+hide-forwarded-for-headers

8.5.11. hide-from-header

Keep your (old and ill) browser from telling web servers your email address

Deletes any existing "From:" HTTP header, or replaces it with the specified string.

Parameterized.

Keyword: "block", or any user defined value.

The keyword "block" will completely remove the header (not to be confused with the block action).

Alternately, you can specify any value you prefer to be sent to the web server. If you do, it is a matter of fairness not to use any address that is actually used by a real person.

This action is rarely needed, as modern web browsers don't send "From:" headers anymore.

Example usage:

+hide-from-header{block}
or
+hide-from-header{spam-me-senseless@sittingduck.example.com}

8.5.12. hide-referrer

Typical use:

Conceal which link you followed to get to a particular site

Effect:

Deletes the "Referer:" (sic) HTTP header from the client request, or replaces it with a forged one.

Type:

Parameterized.

Parameter:

"block" to delete the header completely.
"forge" to pretend to be coming from the homepage of the server we are talking to.
Any other string to set a user defined referrer.

Notes:

"forge" is the preferred option here, since some servers will not send images back otherwise, in an attempt to prevent their valuable content from being embedded elsewhere (and hence, without being surrounded by their banners).

hide-referer is an alternate spelling of hide-referrer and the two can be can be freely substituted with each other. ("referrer" is the correct English spelling, however the HTTP specification has a bug - it requires it to be spelled as "referer".)

Example usage:

+hide-referrer{forge}
or
+hide-referrer{http://www.yahoo.com/}

8.5.13. hide-user-agent

Conceal your type of browser and client operating system

Replaces the value of the "User-Agent:" HTTP header in client requests with the specified value.

Parameterized.

Any user-defined string.

Using this action in multi-user setups or wherever different types of browsers will access the same Privoxy is not recommended. In single-user, single-browser setups, you might use it to delete your OS version information from the headers, because it is an invitation to exploit known bugs for your OS. It is also occasionally useful to forge this in order to access sites that won't let you in otherwise (though there may be a good reason in some cases). Example of this: some MSN sites will not let Mozilla enter, yet forging to a Netscape 6.1 user-agent works just fine. (Must be just a silly MS goof, I'm sure :-).

Warning

This breaks many web sites that depend on looking at this header in order to customize their content for different browsers (which, by the way, is NOT a smart way to do that!).

This action is scheduled for improvement.

+hide-user-agent{Netscape 6.1 (X11; I; Linux 2.4.18 i686)}

8.5.14. kill-popups

Typical use:

Eliminate those annoying pop-up windows

Effect:

While loading the document, replace JavaScript code that opens pop-up windows with (syntactically neutral) dummy code on the fly.

Type:

Boolean.

Parameter:

N/A

Notes:

This action is easily confused with the built-in, hardwired filter action, but there are important differences: For kill-popups, the document need not be buffered, so it can be incrementally rendered while downloading. But kill-popups doesn't catch as many pop-ups as filter{popups} does.

Think of it as a fast and efficient replacement for a filter that you can use if you don't want any filtering at all. Note that it doesn't make sense to combine it with any filter action, since as soon as one filter applies, the whole document needs to be buffered anyway, which destroys the advantage of the kill-popups action over its filter equivalent.

Killing all pop-ups is a dangerous business. Many shops and banks rely on pop-ups to display forms, shopping carts etc, and killing only the unwanted pop-ups would require artificial intelligence in Privoxy. If the only kind of pop-ups that you want to kill are exit consoles (those really nasty windows that appear when you close an other one), you might want to use filter{js-annoyances} instead.

Example usage:

+kill-popups

8.5.15. limit-connect

Prevent abuse of Privoxy as a TCP proxy relay

Specifies to which ports HTTP CONNECT requests are allowable.

Parameterized.

A comma-separated list of ports or port ranges (the latter using dashes, with the minimum defaulting to 0 and the maximum to 65K).

By default, i.e. if no limit-connect action applies, Privoxy only allows HTTP CONNECT requests to port 443 (the standard, secure HTTPS port). Use limit-connect if more fine-grained control is desired for some or all destinations.

The CONNECT methods exists in HTTP to allow access to secure websites ("https://" URLs) through proxies. It works very simply: the proxy connects to the server on the specified port, and then short-circuits its connections to the client and to the remote server. This can be a big security hole, since CONNECT-enabled proxies can be abused as TCP relays very easily.

If you don't know what any of this means, there probably is no reason to change this one, since the default is already very restrictive.

Example usages:

+limit-connect{443} # This is the default and need not be specified. +limit-connect{80,443} # Ports 80 and 443 are OK. +limit-connect{-3, 7, 20-100, 500-} # Ports less than 3, 7, 20 to 100 and above 500 are OK. +limit-connect{-} # All ports are OK (gaping security hole!)

8.5.16. prevent-compression

Ensure that servers send the content uncompressed, so it can be passed through filters

Effect:

Adds a header to the request that asks for uncompressed transfer.

Type:

Boolean.

Parameter:

N/A

Notes:

More and more websites send their content compressed by default, which is generally a good idea and saves bandwidth. But for the filter, deanimate-gifs and kill-popups actions to work, Privoxy needs access to the uncompressed data. Unfortunately, Privoxy can't yet(!) uncompress, filter, and re-compress the content on the fly. So if you want to ensure that all websites, including those that normally compress, can be filtered, you need to use this action.

This will slow down transfers from those websites, though. If you use any of the above-mentioned actions, you will typically want to use prevent-compression in conjunction with them.

Note that some (rare) ill-configured sites don't handle requests for uncompressed documents correctly (they send an empty document body). If you use prevent-compression per default, you'll have to add exceptions for those sites. See the example for how to do that.

Example usage (sections):

# Set default: # {+prevent-compression} / # Match all sites # Make exceptions for ill sites: # {-prevent-compression} www.debianhelp.org www.pclinuxonline.com

8.5.17. send-vanilla-wafer

Feed log analysis scripts with useless data.

Sends a cookie with each request stating that you do not accept any copyright on cookies sent to you, and asking the site operator not to track you.

The vanilla wafer is a (relatively) unique header and could conceivably be used to track you.

This action is rarely used and not enabled in the default configuration.

+send-vanilla-wafer

8.5.18. send-wafer

Send custom cookies or feed log analysis scripts with even more useless data.

Sends a custom, user-defined cookie with each request.

Multi-value.

A string of the form "name=value".

Being multi-valued, multiple instances of this action can apply to the same request, resulting in multiple cookies being sent.

This action is rarely used and not enabled in the default configuration.

Example usage (section):

{+send-wafer{UsingPrivoxy=true}} my-internal-testing-server.void

8.5.19. session-cookies-only

Allow only temporary "session" cookies (for the current browser session only).

Deletes the "expires" field from "Set-Cookie:" server headers. Most browsers will not store such cookies permanently and forget them in between sessions.

This is less strict than crunch-incoming-cookies / crunch-outgoing-cookies and allows you to browse websites that insist or rely on setting cookies, without compromising your privacy too badly.

Most browsers will not permanently store cookies that have been processed by session-cookies-only and will forget about them between sessions. This makes profiling cookies useless, but won't break sites which require cookies so that you can log in for transactions. This is generally turned on for all sites, and is the recommended setting.

It makes no sense at all to use session-cookies-only together with crunch-incoming-cookies or crunch-outgoing-cookies. If you do, cookies will be plainly killed.

Note that it is up to the browser how it handles such cookies without an "expires" field. If you use an exotic browser, you might want to try it out to be sure.

Example usage:

+session-cookies-only

8.5.20. set-image-blocker

Choose the replacement for blocked images