For example, the importer will give you an option to remove a repeated phrase from page titles. It's a good idea to look through your HTML files and make sure things are consistent before you run the importer. Otherwise, you’ll flood your social network with your imported posts! Cleaning Things Up If you’ve installed a plugin that crossposts your content to another site (like Facebook or LiveJournal) or automatically notifies another site of your new posts (like Twitter), be sure to deactivate those plugins before you import. If you’re importing content into a WordPress site that already contains content, back up your database and put the site into maintenance mode before you begin importing.
SITESUCKER WORDPRESS DATABASE INSTALL
(Despite the name, Mass Page Remover works just as well on posts.) You’ll probably need to do a little bit of cleanup work afterward, so you’ll need to install the Search and Replace plugin as well. Therefore, it’s important to install the DB Backup and Mass Page Remover plugins in case you need to start over. Importing can be tricky, and it doesn’t always go well the first time. The importer used the index file to represent the folder, then imported the other file in that directory as a child page: For example, on this site, there were two files in a "keynotes" directory: If the directory contained a default index file (in this case, index.html), the contents of that file will be used for the parent page. If you choose to import the files as pages (or any hierarchical post type), the new page hierarchy will match your original directory hierarchy.
(I've done this a few times for clients who'd forgotten their passwords.) The importer works great on the downloaded files.
SITESUCKER WORDPRESS DATABASE DOWNLOAD
If you do not have (S)FTP access to the old site, you can use an application like SiteSucker to download the files. The files you are importing must be on the same server as your WordPress installation. The importer will try to import improperly nested HTML, but it might not work as you expect! Your files don’t necessarily have to validate according to the W3C specification, but they should at least contain tags that are properly nested. It therefore works best with well-formed HTML. The plugin works by reading HTML as XML and copying the specified tags’ contents into various WordPress fields. Updates coming in a few days! - December 7 How This Works I haven't updated this to reflect the UI changes and new features in 2.3 yet. HTML Import 2.0 User Guide HTML Import 2.0 User Guide