You take care of a somewhat large website for a certain organization.
Or leave to work for another organization.
You decide to scrape a wiki containing a lot of organization info as part of a new data administration style. So now to get the externally hosted wiki into your site, and make it look good. However, the SOAP API won’t be available to you for at least a year. So you think PHP, libcurl with regexps, and you become a a bot writer. But Mac Server comes with a decapitated verion of php. So install a new version of php.
Or not.
But the links are all off. So you decide on iFrames and Javascript. Except you are still gonna have to curl the bloody thing onto your server anyways because JS won’t work on iFrames that aren’t on your domain. Except when it will, using even more JS.
Or spend all your time on reddit.
So you start to brush up on Javascript. Apparently everyone writes horrendous Javascript. Except for the 4 libraries that seem right. Not to mention the one you have on the site already, which might not play together well with others.
Or .
You haven’t started looking at the CSS yet, which it is pretty sure won’t display correctly on Internet Explorer 6, at least not without quite a bit of hair pulling, or even more JS.
Or not.
Lo and behold you do have access to the XMLRPC API! But only with the Java CLI. Relearn Java. No wait, you can Applescript it. Learn Applescript. Do I want to? No, wait you can Python it. Learn Python. No wait it isn’t accepting your ‘None’ for Java’s ‘Null’. Learn Jython. No, wait, it is.
Or not.
Using Python and the XMLRPC API you can get the necessary pages and put them on your server. But wait, change the pages from ISO-3000 to UTF-8, we got accents here! Also, your links are off. But you are using Apache, so learn about mod_rewrite. It works! Scrape the whole thing!
Or not.