« A Close Reading | Main | Speaking News »

Comments

Kenneth Rufo

I've never used Spotlight, so I don't know how restrictive the text file requirement is, but if it's relatively loose, here's a suggestion. In the MT installation, add a second individual entry archive, and set the archive file template to something like:

/.txt

[Or .html, if Spotlight has problems stripping the xhtml markups when looking at the files.]

Then rebuild the site, and you should have a number of new mt/subdirs based on the number of extant categories (I know you're not interested in categories, but it would easily solve the problem of having any duplicate filenames, and of course, you can kill the categories dir if you don't like it), and then just download those folders to the HD. It might work - like I said, no Spotlight experience, but it should come close to what you want.

If Spotlight can't read them, at least doing the .html as the archive file template will at least give you name files rather than numbers, which should help when you export the entries.

Kenneth Rufo

Oops, it cut the html/MT tags out of the comment. The /.txt line should read:

$MTEntryCategory dirify="1"$>/$MTEntryTitle dirify="1"$.txt

And the ".html" reference in the last paragraph should read "$MTEntryTitle dirify="1"$.html.

Abe

There is a built in export function in Movable Type. It creates a large text file with all your entries. The html tags are left in, but I find them relatively unobtrusive. It's one of the tabs on the left, labeled import/export.

Chris Janton

The Safari cache files are not plain text or plain images. They are binary files.

You can search the Safari history (titles, URLs, partial URLs) from the Bookmarks window. The search box at the bottom has an option to search the history.

Not as good as full-text on the page contents, but...

DonnaM

Another way of getting the content out:

Create a new index template (copy it from your main index template so you have all the html headers etc) and replace the body with this:


Give the output file a name like "all_content.html". Save the file and rebuild the index. The page will be http://www.stevenberlinjohnson.com/movabletype/all_content.html

It will give you an enormous web page with the title, URL, date and content of all your entries.

nickb

Building on Donna's comment:
Make a master index file (a default is included in the distribution.) Pull all the extraneous information out that you don't want in the final file.

Rebuild to create the file.

Run through Lynx to strip the HTML out. (IMO, Lynx is the best HTML to ASCII interpreter out there, no graphics for it.) From my previous work I prefer the following options to convert HTML to text: -dump -force_html -image_links -pseudo_inlines -dont_wrap_pre -image_links -underscore -width=78

Those are actually the options I use to convert my blog into individual emails. (See http://www.inmff.net/peidm/mailer.txt for the source before it is de-html'd.)

jeffeng

Don't forget, you don't have to strain to remember your exact Google searches. Safari keeps past searches as a pulldown menu in the built-in Google search box at the top of the browser window.

Doug

You are right, the ability to use Spotlight to search your browser cache would be terrific. Until Apple extends Spotlight, you might look at History Hound, a Mac OS X utility that is designed to do just that.

http://www.stclairsoft.com/HistoryHound/index.html

Jim

If you find out how to do it please post it I would love to also move all my moveable tpy einto a better folder.

The comments to this entry are closed.

My Photo

SBJ via Twitter

    follow me on Twitter

    The Basics

    • I'm a father of three boys, husband of one wife, and author of seven books, and co-founder of three web sites. We spend most of the year in Marin County, California though I'm on the road a lot giving talks. (You can see the full story here.) Personal correspondence should go to sbeej at earthlink dot net. Media requests should go to Matthew.Venzon at us.penguingroup dot com. If you're interested in having me speak at an event, drop a line to Wesley Neff at the Leigh Bureau (WesN at Leighbureau dot com.)

    My Books

    • : Where Good Ideas Come From: The Natural History of Innovation

      Where Good Ideas Come From: The Natural History of Innovation
      An exploration of environments that lead to breakthrough innovation, in science, technology, business, and the arts. I conceived it as the closing book in a trilogy on innovative thinking, after Ghost Map and Invention. But in a way, it completes an investigation that runs through all the books. Sold more copies in hardcover than anything else I've written.

    • : The Invention of Air

      The Invention of Air
      The story of the British radical chemist Joseph Priestley, who ended up having a Zelig-like role in the American Revolution. My version of a founding fathers book, and a reminder that most of the Enlightenment was driven by open source ideals.

    • : The Ghost Map

      The Ghost Map
      The latest: the story of a terrifying outbreak of cholera in 1854 London 1854 that ended up changing the world. An idea book wrapped around a page-turner. I like to think of it as a sequel to Emergence if Emergence had been a disease thriller. You can see a trailer for the book here.

    • : Everything Bad Is Good for You: How Today's Popular Culture Is Actually Making Us Smarter

      Everything Bad Is Good for You: How Today's Popular Culture Is Actually Making Us Smarter
      The title says it all. This one sparked a slightly insane international conversation about the state of pop culture -- and particularly games. There were more than a few dissenters, but the response was more positive than I had expected. And it got me on The Daily Show, which made it all worthwhile.

    • : Mind Wide Open : Your Brain and the Neuroscience of Everyday Life

      Mind Wide Open : Your Brain and the Neuroscience of Everyday Life
      My first best-seller, and the only book I've written in which I appear as a recurring character, subjecting myself to a battery of humiliating brain scans. The last chapter on Freud and the neuroscientific model of the mind is one of my personal favorites.

    • : Emergence: The Connected Lives of Ants, Brains, Cities, and Software

      Emergence: The Connected Lives of Ants, Brains, Cities, and Software
      The story of bottom-up intelligence, from slime mold to Slashdot. Probably the most critically well-received all my books, and the one that has influenced the most eclectic mix of fields: political campaigns, web business models, urban planning, the war on terror.

    • : Interface Culture : How New Technology Transforms the Way We Create and Communicate

      Interface Culture : How New Technology Transforms the Way We Create and Communicate
      My first. The book I wrote instead of finishing my dissertation. Still in print almost a decade later, and still relevant, I think. But I haven't read it in a while, so who knows what's in there!

    Blog powered by TypePad