LayoutURL
LayoutURL is similar to URL, but defines a layout for a specific pattern. If a
page's URL falls within this pattern, and parameters are defined for this
layout, but not defined by the site file, the layout parameters will be used.
This allows an easy way to specify default values that several sites can use;
just define them in a layouts file such as lib/layouts.site and
they will be inherited.
LevelNPrint / IssuePrint / ContentsPrint
Specify whether a links-level page should be printed, ie. output. The default
is 0 for text-style output, or 1 for HTML-style output. (HTML-style in this
case is defined as supporting hyperlinks, including iSilo etc.)
There is no StoryPrint, as stories are always printed.
StoryHeadline
This specifies a regular expression pattern used to search for the
story's headline or title. This is primarily useful for DOC-format
output, where a bookmark is created at the start of each story
using the headline as a bookmark title.
The story HTML is searched for this pattern before StoryStart
and StoryEnd stripping takes place.
It should be specified as a regular expression containing a single
(pattern) subexpression; the text that matches
the section between brackets is used as the headline text.
StoryToPrintableSub
A Perl regular expression substitution used to convert story links to a form
more suitable for sitescooper output. For example, many sites provide multiple
views of a story, including a "printable" view for printing, and often the
"printable" view is more amenable to scooping than the non-printable version.
StoryToPrintableSub allows you to convert the story URLs to this
"printable" format.
The StoryURL pattern must match the "printable" version.
It does not need to match the original, "non-printable" format.
The format of a perl substitution is as follows:
s,from-pattern,replacement, where from-pattern is a
perl regexp pattern, generally containing (pattern)
subexpressions, and replacement is a replacement text containing
\number markers where the strings matched by the bracketed bits are
inserted.
See the FAQ entry
on multi-page stories in the Writing a .site File document
for more information.
ImageScaleToMaxWidth
Specify the maximum width of an image. By default, this is 300, the rough width
in pixels of the Palm handheld's screen; sites with large images, such as
comics, can specify a larger value, which requires the user to scroll around
the image but generally improves the readability of the picture.
This is not the way to solve the problem, by the way, so this parameter
may go away or change in some way in the future...
ImageProcess
A chunk of Perl code which will be used to transform every image that
sitescooper downloads.
The filename of the image downloaded from the website is passed in as
$img_in, and the processed image should be written to the file named in
$img_out. Set $img_out to the undef value if you want to
skip that image.
This parameter is intended to allow the use of image rotation, resizing or
quantizing code. For these purposes, the PerlMagick
module may prove very useful.
URLProcess
A chunk of Perl code which will be used to transform every URL that sitescooper
needs to download. This allows a huge degree of control over the links that
sitescooper operates on.
The URL to operate on is passed in as $_, and the post-processed URL is
expected to be in $_ afterwards. Set $_ to the undef
value if you want to skip that URL.
Note that links which do not pass the StoryURL, etc.
patterns will be dropped before URLProcess takes effect, so make sure
those patterns are open enough for this.