<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>nathenson&#039;s digital garbage &#187; White House</title>
	<atom:link href="http://digitalgarbage.net/tag/white-house/feed/" rel="self" type="application/rss+xml" />
	<link>http://digitalgarbage.net</link>
	<description>dumpster-diving for bits about law, info, tech, and culture</description>
	<lastBuildDate>Wed, 16 Nov 2011 05:00:26 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>NARA hosting &#8220;lite&#8221; Bush website archive</title>
		<link>http://digitalgarbage.net/2009/02/16/nara-bush-archive/</link>
		<comments>http://digitalgarbage.net/2009/02/16/nara-bush-archive/#comments</comments>
		<pubDate>Mon, 16 Feb 2009 05:02:08 +0000</pubDate>
		<dc:creator>Ira Nathenson</dc:creator>
				<category><![CDATA[Digital Preservation]]></category>
		<category><![CDATA[Wayback Machine]]></category>
		<category><![CDATA[NARA]]></category>
		<category><![CDATA[Robots.txt]]></category>
		<category><![CDATA[White House]]></category>

		<guid isPermaLink="false">http://digitalgarbage.net/?p=1088</guid>
		<description><![CDATA[There are plenty of good changes in the new whitehouse.gov site, such as a better copyright policy that enables clearer copying and remix, and a much shorter robots.txt file, which makes it easier for search engines and archivists to index &#8230; <a href="http://digitalgarbage.net/2009/02/16/nara-bush-archive/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>There are plenty of good changes in the new <a href="http://www.whitehouse.gov/">whitehouse.gov</a> site, such as a better <a href="http://www.whitehouse.gov/copyright/">copyright policy</a> that enables clearer copying and remix, and a much shorter robots.txt file, which makes it easier for search engines and archivists to index and archive the site.  (Compare the <a href="http://www.whitehouse.gov/robots.txt">current</a> 4-line Obama robots file to a 2300+ <a href="http://georgewbush-whitehouse.archives.gov/robots.txt">version</a> from apparently late in the Bush era.)</p>
<p>But what about Bush&#8217;s old website?  Shouldn&#8217;t that be preserved?  (Well, yeah!)  But when President Obama took the oath of office, things switched over and the Bush site was gone from public view.  Did anybody keep a copy?  Well, yes, kind of.  The Internet Archive archives the whitehouse.gov site, but I have deep concerns about the completeness of its archive.  See below for a screen cap of the Internet Archive&#8217;s <a href="http://web.archive.org/web/*sa_/http://www.whitehouse.gov">database</a> of http://www.whitehouse.gov.</p>
<p><a href="http://digitalgarbage.net/wp-content/uploads/2009/02/wh-ia-2.jpg"><img class="aligncenter size-medium wp-image-1106" title="Internet Archive captures of whitehouse.gov" src="http://digitalgarbage.net/wp-content/uploads/2009/02/wh-ia-2-300x240.jpg" alt="Internet Archive captures of whitehouse.gov" width="300" height="240" /></a></p>
<p>I think it can be taken as an axiom that in a free society, it&#8217;s vital that governmental sites are archived frequently, deeply, accurately, and made available for scrutiny quickly.  But the depth of the Internet Archive&#8217;s archive of whitehouse.gov is unclear.  First, to the extent that the Bush administration&#8217;s robots.txt file told search engines and archives to stay away, did the Internet Archive fail to archive governmental content?  (Maybe not, but how can we be sure?)  Second, the Internet Archive is not up-to-date: as of this writing, the most recent public archive of whitehouse.gov is dated Mar. 25, 2008.  Finally and even more disturbingly, the Internet Archive&#8217;s frequency is poor.  It contains only 53 captures of the main whitehouse.gov page for 2007, and only 15 have yet been posted from calendar year 2008.  We can do better.</p>
<p>Interestingly, it appears that government archivists are now dipping their feet in the water.  At least part of the legacy Bush 2009 website is now being hosted by the National Archives and Records Administration (&#8220;NARA&#8221;), which administers the <a href="http://www.georgewbushlibrary.gov/">George W. Bush Presidential Library</a>.  According to the <a href="http://www.georgewbushlibrary.gov/white-house/">site</a>:</p>
<blockquote><p>To preserve the historical record of the George W. Bush administration&#8217;s presence on the web, the White House took a &#8220;snapshot&#8221; of the Whitehouse.gov web site. This is historical material, &#8220;frozen in time.&#8221; The web site is no longer updated and links to external web sites and some internal pages will not work.</p></blockquote>
<p>Having NARA archivists maintain an archive is a good start.  (Though there should always be archives maintained by disinterested third parties as well.)  But it&#8217;s not enough to have a &#8220;snapshot&#8221; of a presidential website.  Not only does the archive lack temporal depth (it&#8217;s only from materials existing in January 2009), but it appears to be incomplete as well, as even some internal links are admitted not to function.  Plus, as the site indicates, the &#8220;White House&#8221; took the snapshot.  I take this to mean that it was taken by interested White House insiders rather than by (hopefully) disinterested professional archivists at NARA.</p>
<p>H/T on Bush Archive to <a href="http://twitter.com/BushLegacy/status/1213166967">BushLegacy</a> via Twitter.</p>
]]></content:encoded>
			<wfw:commentRss>http://digitalgarbage.net/2009/02/16/nara-bush-archive/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A presidential &#8220;legacy&#8221; via rewritten history</title>
		<link>http://digitalgarbage.net/2008/11/25/presidential-legacy/</link>
		<comments>http://digitalgarbage.net/2008/11/25/presidential-legacy/#comments</comments>
		<pubDate>Tue, 25 Nov 2008 21:01:35 +0000</pubDate>
		<dc:creator>Ira Nathenson</dc:creator>
				<category><![CDATA[Digital Preservation]]></category>
		<category><![CDATA[Internet Archive]]></category>
		<category><![CDATA[White House]]></category>

		<guid isPermaLink="false">http://digitalgarbage.net/?p=543</guid>
		<description><![CDATA[Web archiving is a topic of great interest to me and the subject of an article I&#8217;m writing.  Part of the paper addresses the Bush administration&#8217;s questionable conduct regarding the content of the White house website.  For example, the White &#8230; <a href="http://digitalgarbage.net/2008/11/25/presidential-legacy/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Web archiving is a topic of great interest to me and the subject of an article I&#8217;m writing.  Part of the paper addresses the Bush administration&#8217;s questionable conduct regarding the content of the White house website.  For example, the White House website&#8217;s <a href="http://www.whitehouse.gov/robots.txt">robots exclusion file</a> &#8212; a <a href="http://www.robotstxt.org/">mechanism</a> that can be used to ask search engine and web archive spiders to stay away &#8212; is nearly 2300 lines long.  2300 lines?  Simply absurd.  (Click <a href="http://digitalgarbage.net/wp-content/uploads/2008/11/wh-robots-2008-11-25.txt">here</a> for a copy of the White House robots file that I downloaded on Nov. 25, 2008.)</p>
<p>Today, researchers at the University of Illinois released a <a href="http://www.clinecenter.uiuc.edu/airbrushing_history/">study</a> showing how the White House has deleted or modified portions of its website.  Their findings are, sadly, unsurprising:</p>
<blockquote>
<p align="justify">Legacies are in the air as President Bush prepares to leave the  White House. How future historians will judge the president remains to be seen,  but one thing is certain: future historians won&#8217;t have all the facts needed to  make that judgment. One legacy at risk of being forgotten is the way the Bush  White House has quietly deleted or modified key documents in the public record  that are maintained under its direct control.</p>
<p align="justify">Remember the &#8220;Coalition of the Willing&#8221; that sided with the  United States during the 2003 invasion of Iraq? If you search the White House  web site today you&#8217;ll find a <a href="http://www.whitehouse.gov/infocus/iraq/news/20030327-10.html">press  release</a> dated March 27, 2003 listing 49 countries forming the coalition. A  key piece of evidence in the historical record, but also a troubling one. It is  an impostor.</p>
<p align="justify">And although there were only 45 coalition members on the eve of the Iraq  invasion, later deletions and revisions to key documents make it seem that there  were always 49.</p>
</blockquote>
<p>The study is a disturbing read.  Rightly or not, a primary source of history for many researchers is the web.  And any effort by the government to modify or delete historical records is appalling.  As the authors note:</p>
<blockquote><p>Updating lists to keep up with the times is one thing. Deleting original  documents from the White House archives is another. Back-dating later documents  and using them to replace the originals goes beyond irresponsible stewardship of  the public record. It is rewriting history.</p></blockquote>
<p style="text-align: left;">H/T: <a href="http://www.nytimes.com/2008/11/25/washington/25documents.html">New York Times</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://digitalgarbage.net/2008/11/25/presidential-legacy/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>GAO report and pending bill on federal e-mail retention</title>
		<link>http://digitalgarbage.net/2008/07/09/federal-agency-e-mail-retention/</link>
		<comments>http://digitalgarbage.net/2008/07/09/federal-agency-e-mail-retention/#comments</comments>
		<pubDate>Wed, 09 Jul 2008 15:39:17 +0000</pubDate>
		<dc:creator>Ira Nathenson</dc:creator>
				<category><![CDATA[Data Retention]]></category>
		<category><![CDATA[Digital Preservation]]></category>
		<category><![CDATA[E-mail]]></category>
		<category><![CDATA[Legislation]]></category>
		<category><![CDATA[National Archives]]></category>
		<category><![CDATA[White House]]></category>

		<guid isPermaLink="false">http://digitalgarbage.net/?p=123</guid>
		<description><![CDATA[The GAO has released a report entitled Federal Records: National Archives and Selected Agencies Need to Strengthen E-Mail Management. The report found that &#8220;[w]ithout periodic evaluations of recordkeeping practices or other controls to ensure that staff are trained and carry &#8230; <a href="http://digitalgarbage.net/2008/07/09/federal-agency-e-mail-retention/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>The GAO has released a <a href="http://www.gao.gov/cgi-bin/getrpt?GAO-08-742">report</a> entitled <em>Federal Records: National Archives and Selected Agencies Need to Strengthen E-Mail Management. </em>The report found that &#8220;[w]ithout periodic evaluations of recordkeeping practices or other controls to ensure that staff are trained and carry out their responsibilities, agencies have little assurance that e-mail records are properly identified, stored, and preserved.&#8221;  It also stated:</p>
<blockquote><p>Although NARA [i.e., the National Archives and Records Administration] has responsibilities for oversight of agencies’ records and records management programs and practices, including conducting inspections or surveys, performing studies, and reporting results to the Congress and the Office of Management and Budget (OMB), in recent years NARA’s oversight activities have been primarily limited to performing studies. NARA has conducted no inspections of agency records management programs since 2000, because it uses inspections only to address cases of the highest risk, and no recent cases have met its criteria. In addition, NARA has not consistently reported details on records management problems or recommended practices that were discovered as a result of its studies. Without more comprehensive evaluations of agency records management, NARA has limited assurance that agencies are appropriately managing the records in their custody and that important records are not lost.</p></blockquote>
<p>Meanwhile, the White House is <a href="http://www.whitehouse.gov/omb/legislative/sap/110-2/saphr5811-r.pdf">threatening</a> a veto of the <a href="http://www.thomas.gov/cgi-bin/bdquery/z?d110:h.r.05811:">Electronic Message Preservation Act</a>.  According to the <a href="http://historycoalition.org/2008/06/13/history-coalition-endorses-electronic-message-preservation-act/">National Coalition for History</a>, the bill &#8220;would direct the National Archives and Records Administration (NARA) to establish standards for the capture, management, preservation and retrieval of federal agency and presidential electronic messages that are records in an electronic format.&#8221;  (Further info on the bill <a href="http://chronicle.com/news/article/4675/advocacy-group-for-history-endorses-electronic-records-bill?utm_source=at&amp;utm_medium=en">here</a>, and on issues concerning the White House&#8217;s e-mail retention practices <a href="http://www.washingtonpost.com/wp-dyn/content/article/2008/01/21/AR2008012102070.html">here</a>.)</p>
<p>Hat tip regarding the GAO report to <a href="http://www.davidmattison.ca/wordpress/?p=2358">David Mattison</a> at the <a href="http://www.davidmattison.ca/wordpress/">Ten-Thousand Year Blog</a>.  Hat tip regarding the bill to BNA&#8217;s Privacy Law Watch.</p>
]]></content:encoded>
			<wfw:commentRss>http://digitalgarbage.net/2008/07/09/federal-agency-e-mail-retention/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

