The Internet Archive obeys robots.txt of course (lucky for you if you have access to it on your site, otherwise not so much) and they will also agree to remove things at the domain owners request. Other libraries might not be so accommodating, specifically the Danish netarchive might not be so accommodating, lets look at some stuff they say - the following is from the already linked survey report:
Results tagged “internet archive” from O'Reilly News
Popular Topics
| Actionscript | Ajax |
| Apache | C# |
| CSS | Flex |
| Head First | iPhone |
| Java | JavaScript |
| Linux | Missing Manuals |
| MySQL | Open Source |
| Perl | PHP |
| Photoshop | Python |
| Ruby | Web 2.0 |
| XML |
Browse Books
News Topics
- balisage conference 08
- government
- javascript
- linux
- open source
- oscon
- perl
- python
- web 2.0
- xml
- xslt
- adobe
- app engine
- apple
- balisage conference 08
- cloud computing
- community
- conference
- congress
- creative commons
- databases
- documentation
- dojo
- economy
- energy
- free software
- government
- green computing
- iphone
- it
- java
- javascript
- linux
- microsoft
- missing manuals
- mobile
- music
- mysql
- ooxml
- open source
- open standards
- oscon
- perl
- physics
- politics
- privacy
- pymotw
- python
- rails
- rest webservices
- schematron
- science
- security
- standards
- velocity
- web 2.0
- xml
- xquery
- xrx
- xslt
Archives
Or, visit our complete archives.

