Re: [SLUG] Web Page Storage

From: steve szmidt (steve@szmidt.org)
Date: Thu May 11 2006 - 16:08:59 EDT


On Thursday 11 May 2006 15:51, S0TL wrote:
> On Thursday 11 May 2006 12:01 pm, Kwan Lowe wrote:
> > > So my question is how can one save what is open on a website say as a
> > > MS Office .doc or OpenOffice OO file which is searchable, has the same
> > > information, and does not take 10 minutes or so per page to do? [I am
> > > assume of course that the website is something like HTML not something
> > > like Adobe Acrobat.]
> >
> > If you're mainly concerned about the text of the document I'd suggest
> > using wget or curl to pull the web page, then process the html to create
> > a text document using links. I.e., wget URL; links -dump file.html >
> > file.txt
>
> Is there a printer in Linux like ther is in Windows that will print to
> file?
>
CUPS does that standardly.

-- 

Steve Szmidt

"To enjoy the right of political self-government, men must be capable of personal self-government - the virtue of self-control. A people without decency cannot be secure in its liberty. From the Declaration Principles ----------------------------------------------------------------------- This list is provided as an unmoderated internet service by Networked Knowledge Systems (NKS). Views and opinions expressed in messages posted are those of the author and do not necessarily reflect the official policy or position of NKS or any of its employees.



This archive was generated by hypermail 2.1.3 : Fri Aug 01 2014 - 18:51:30 EDT