<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Colabrativ, Inc. &#187; database</title>
	<atom:link href="http://www.colabrativ.com/tag/database/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.colabrativ.com</link>
	<description>An Experiment Documentation and Electronic Notebook Provider</description>
	<lastBuildDate>Tue, 28 Oct 2014 04:44:35 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Todd Park&#8217;s Talk on Unleashing the Power of Open Data and Innovation for Health Care</title>
		<link>http://www.colabrativ.com/todd-park-talk-on-unleashing-the-power-of-open-data-and-innovation-for-health-care/</link>
		<comments>http://www.colabrativ.com/todd-park-talk-on-unleashing-the-power-of-open-data-and-innovation-for-health-care/#comments</comments>
		<pubDate>Thu, 27 Sep 2012 15:17:49 +0000</pubDate>
		<dc:creator>Marc Whitlow</dc:creator>
				<category><![CDATA[Technical]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[Healthcare]]></category>
		<category><![CDATA[innovation]]></category>
		<category><![CDATA[Open Data]]></category>

		<guid isPermaLink="false">http://www.colabrativ.com/?p=575</guid>
		<description><![CDATA[I had the chance to hear the rebroadcast of a talk by Todd Park, the U.S. Chief Technology Officer, given on June 18, 2012 at The Commonwealth Club of California, on Unleashing the Power of Open Data and Innovation for &#8230; <a href="http://www.colabrativ.com/todd-park-talk-on-unleashing-the-power-of-open-data-and-innovation-for-health-care/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>I had the chance to hear the rebroadcast of a talk by Todd Park, the U.S. Chief Technology Officer, given on June 18, 2012 at The Commonwealth Club of California, on <a target="_Blank" href="http://www.commonwealthclub.org/events/2012-06-18/just-added-todd-park-us-chief-technology-officer">Unleashing the Power of Open Data and Innovation for Health Care.</a>  He is a leading proponent for open health data in the U.S. health system.  His team at Health and Human Services in collaboration with the <a target="_Blank" href="http://www.iom.edu/Activities/PublicHealth/HealthData.aspx">Institute of Medicine</a> launched the <a target="_Blank" href="http://healthdata.gov/">Health Data Initiative</a> in 2009.  This initiative follows in the footsteps of two other successful U.S. government open data projects.  40 year ago National Oceanic and Atmospheric Administration (NOAA) made weather data available for free public download.  In the 1980s, the U.S. government began making the Global Positioning System (GPS) available to the public.  </p>
<h3>Health Care Cost Reduction through Open Data</h3>
<p>The goal for this Health Data Initiative was to spur health care innovation that will drive down the cost of healthcare by allowing everyone to have access to health care information while <strong>protecting privacy and maintaining confidentiality of the information</strong>.  Mr. Park presented an example of how quickly such a scenario could come about.  In February of 2011, Georgetown hosted a hack-a-ton.  IN 8 hours, a group from Mia in Pittsburgh having no healthcare background, but with expertise in supply chain management built a working prototype of Food Oasis, an app to address the food desert problem.  If you live in a food desert, you do not have access to healthy food.  You can text a message to Food Oasis indicating the food you would like to purchase.  The message gets sent to a web site where farmers&#8217; markets and food coops can view the orders.  These suppliers then aggregate the data and find the orders they can fulfill.  They then text back to consumers when and where their orders can be picked up.  Due to the low overhead, this turns out to be a cost effective way of resolving this problem.  </p>
<h3>You Have the Right to Your Medical Records</h3>
<p><strong>You absolutely have the rights to your medical records.</strong>  In fact the head of the Department of Health and Human Services, check withing Todd for the name, has published an open letter stating that you have the rights to your own medical records. </p>
<h3>Information at HealthData.gov</h3>
<p>At <a target="_Blank" href="http://healthdata.gov/">healthdata.gov</a> you can find information on: </p>
<table style="border:0px solid white;">
<tr>
<td style="border:0px solid white; width:150px;">Administrative</td>
<td style="border:0px solid white;">Data on administering health care delivery, enrollment into health insurance plans and appeals.</td>
</tr>
<tr>
<td style="border:0px solid white;">Biomedical Research</td>
<td style="border:0px solid white;">Authoritative, up-to-date medical and scientific information resources for patients, families, health care providers and researchers.</td>
</tr>
<tr>
<td style="border:0px solid white;">Children&#8217;s Health</td>
<td style="border:0px solid white;">Information on children’s health and health-related services for researchers, policymakers, patients and families.</td>
</tr>
<tr>
<td style="border:0px solid white;">Epidemiology</td>
<td style="border:0px solid white;">Public health databases and registries regarding births, deaths, disease incidence, health event case reports, demographics, community health.</td>
</tr>
<tr>
<td style="border:0px solid white;">Health Care Cost</td>
<td style="border:0px solid white;">Includes National Health Expenditure Accounts (NHEA), the official estimates of total health care spending in the United States.</td>
</tr>
<tr>
<td style="border:0px solid white;">Health Care Providers</td>
<td style="border:0px solid white;">Freedom of Information Act disclosable health care provider data for providers.</td>
</tr>
<tr>
<td style="border:0px solid white;">Medicaid</td>
<td style="border:0px solid white;">General information on eligibility and claims data developed to support research and policy analysis initiatives for Medicaid recipients and other low-income populations.</td>
</tr>
<tr>
<td style="border:0px solid white;">Medicare</td>
<td style="border:0px solid white;">Cost report data from annual reports filed by hospitals, home health agencies, and other facilities; claim-level public use files for all major types of care.</td>
</tr>
<tr>
<td style="border:0px solid white;">Population Statistics</td>
<td style="border:0px solid white;">Metrics on community health, health care system, and determinants-of-health performance at national, state or county levels.</td>
</tr>
<tr>
<td style="border:0px solid white;">Quality Measurement</td>
<td style="border:0px solid white;">Quality and patient satisfaction data available via Application Programming Interfaces (APIs) for nursing homes, hospitals, home health agencies, and dialysis centers.</td>
</tr>
<tr>
<td style="border:0px solid white;">Safety</td>
<td style="border:0px solid white;">Includes all company-issued recalls for drugs, food, products from 2009 to the present; hazardous substances and environmental and public health maps.</td>
</tr>
<tr>
<td style="border:0px solid white;">Treatments</td>
<td style="border:0px solid white;">Information and databases about marketed drugs, including downloadable resources on medication content and labeling, text messaging libraries and product listing directories.</td>
</tr>
</table>
<p>I would encourage you to listen to the recording of Mr. Park&#8217;s talk on <a target="_Blank" href="http://www.commonwealthclub.org/events/2012-06-18/just-added-todd-park-us-chief-technology-officer">The Commonwealth Club of California</a> web site.  He covered many more health case related and open data topics during the hour he talked and answered audience questions. </p>
]]></content:encoded>
			<wfw:commentRss>http://www.colabrativ.com/todd-park-talk-on-unleashing-the-power-of-open-data-and-innovation-for-health-care/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Creation of Test Patients Table in Microsoft Excel</title>
		<link>http://www.colabrativ.com/creation-of-test-patients-table-in-microsoft-excel/</link>
		<comments>http://www.colabrativ.com/creation-of-test-patients-table-in-microsoft-excel/#comments</comments>
		<pubDate>Tue, 18 Oct 2011 18:28:58 +0000</pubDate>
		<dc:creator>Marc Whitlow</dc:creator>
				<category><![CDATA[Technical]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[Excel]]></category>
		<category><![CDATA[patients]]></category>

		<guid isPermaLink="false">http://www.colabrativ.com/?p=260</guid>
		<description><![CDATA[We are helping Deb Zajchowski at The Clearity Foundation on their patient database. The Clearity Foundation is a non-profit organization dedicated to &#34;improving treatment options for ovarian cancer patients.&#34; To improve treatment they take a personalized medicine approach. They have &#8230; <a href="http://www.colabrativ.com/creation-of-test-patients-table-in-microsoft-excel/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>We are helping Deb Zajchowski at <a target="_blank" href="http://www.clearityfoundation.org/default.aspx">The Clearity Foundation</a> on their patient database. The Clearity Foundation is a non-profit organization dedicated to &quot;improving treatment options for ovarian cancer patients.&quot; To improve treatment they take a personalized medicine approach.  They have a privacy-ensured database in which they collect information on the patient’s clinical history, including physician’s diagnosis, diagnostic procedures, treatments and the results from tumor molecular profiling analyses.  The database also records the drugs that are likely to have clinical benefit based on the profile of the patient’s cancer. </p>
<p>Our goal is to enhance the Clearity database ability to track the patient outcomes in a retrospective analysis.  The current database was developed by Michael L. Petka.  It works well for the profiling and reporting needs of the organization.  However, Mike is currently occupied with other aspects of database enhancement. </p>
<p>In order to ensure the privacy of the patient, we needed a test database that did not contain any actual patient information. In this blog I will describe how we created a test Patients table in Excel. </p>
<p>The Patients table was created by Michael L. Petka, and is made up of the following columns:</p>
<table>
<thead>
<caption><strong>Patients Table Columns</strong></caption>
<tr>
<th>Column Name</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>PatientID</td>
<td>Patient Identification Key</td>
</tr>
<tr>
<td>PatientLastName</td>
<td>Last Name</td>
</tr>
<tr>
<td>PatientFirstName</td>
<td>First Name</td>
</tr>
<tr>
<td>PatientMiddleInitial</td>
<td>Middle Initial</td>
</tr>
<tr>
<td>PatAddress</td>
<td>Street Address</td>
</tr>
<tr>
<td>PatCity</td>
<td>City</td>
</tr>
<tr>
<td>PatState</td>
<td>State</td>
</tr>
<tr>
<td>PatZip</td>
<td>5-digit ZIP Code</td>
</tr>
<tr>
<td>SSN</td>
<td>Social Security Number</td>
</tr>
<tr>
<td>DOB</td>
<td>Date of Birth</td>
</tr>
<tr>
<td>Sex</td>
<td>Sex</td>
</tr>
<tr>
<td>PatientTelephoneNumber</td>
<td>Telephone Number</td>
</tr>
<tr>
<td>PatientEmail</td>
<td>Patient Email</td>
</tr>
</tbody>
</table>
<h3>Patient Identification Key</h3>
<p>The patient identification key (PatientID) is a sequential numbering of the patient rows.                </p>
<h3>Patient Name</h3>
<p>We were fortunate to find a table of 10,000 random names at <a target="_blank" href="http://www.opensourcecf.com/1/2009/05/10000-Random-Names-Database.cfm">The ColdFusion Open Source Software Blog.</a>  In addition to the columns we needed, this database has complete name strings, both first name first, and last name first. The first name, last name and middle initial columns were cut and pasted into our test Patients table in Excel. </p>
<h3>Patient Sex</h3>
<p>The sex of the random names was determined in Excel by comparing the first name of a random name with a list of girls&#8217; names from <a target="_blank" href="http://www.randomnames.com/allnames.asp?g=f">RandomNames.com</a>.  If the name matched, F (female) was assigned, otherwise M (male) was assigned, using the function below.</p>
<div class="codeBox">
<p><strong>Sex Determination Based on Name</strong></p>
<pre>=IF(EXACT(A5,LOOKUP(A5,I$2:I$895)),"F","M")</pre>
<p>where column A contains the random first name (above element A5 is being examined), and array I2 to I895 contains the girls&#8217; first names.</p>
</div>
<h3>Street Address</h3>
<p>We found 261 of the most popular neighborhood street names at <a target="_blank" href="http://www.livingplaces.com/streets/most-popular_street_names.html">Living Places</a>.  We generated random house numbers and random street names, then concatenated them together in Excel.</p>
<div class="codeBox">
<p><strong>Random House Number</strong></p>
<pre>=INT(11000*RAND())</pre>
<p><strong>Random Street Name</strong></p>
<pre>=INDIRECT("StreetNames!B"&amp;RANDBETWEEN(2,262))</pre>
<p>where array B2 to B262 in the StreetNames worksheet contains the most-popular neighborhood street names. </p>
<p><strong>Street Address Concatenation</strong></p>
<pre>=CONCATENATE(A5," ", B5)</pre>
<p>In the example above we are working on row 5, where column A contains the random house number and column B contains the random street name.</p>
</div>
<h3>City, State and ZIP code</h3>
<p>A table containing city, state and ZIP codes of 80,810 places was downloaded from <a target="_blank" href="http://federalgovernmentzipcodes.us/">A Free Zip Code Database</a> as the Excel file <a href="http://federalgovernmentzipcodes.us/free-zipcode-database.xlsx">free-zipcode-database.xlsx</a>.  This file has more than just city, state and ZIP codes.  It also includes type (STANDARD, PO BOX ONLY, MILITARY, etc.), county name, latitude, longitude, population, land area and water area.  In Excel, the table was sorted on type, for we only wanted to use types STANDARD and PO BOX ONLY.  After removing everything else, the list was reduced from 80,180 to 73,756 entries.  We then generated 10,000 random ZIP keys to match up with our 10,000 random names in Excel using RANDBETWEEN(2,73757).  All of this was stored in the City worksheet in Excel.  In the Patients table (worksheet), the city, state and ZIP codes were added using arandom ZIP code key.  The city name was capitalized using the PROPER function; see below. </p>
<div class="codeBox">
<p><strong>City, State and ZIP Code</strong></p>
<pre>=PROPER(INDIRECT("City!D"&amp;City!M5))

=INDIRECT("City!E"&amp;City!M5))

=INDIRECT("City!A"&amp;City!M5))</pre>
<p>Where &#8220;City!D&#8221;, &#8220;City!E&#8221; and &#8220;City!A&#8221; are the city, state and ZIP code columns, respectively, in the City worksheet. </p>
</div>
<h3>Social Security Number</h3>
<p>A random social security number (SSN) was generated for each fictitious patient by concatenating three RANDBETWEEN operations together; see below.</p>
<div class="codeBox">
<p><strong>Social Security Number</strong></p>
<pre>=CONCATENATE(RANDBETWEEN(100,999),"-",RANDBETWEEN(10,99),"-",MID((RANDBETWEEN(10000,19999)),2,4))</pre>
</div>
<h3>Date of Birth</h3>
<p>We wanted a normal distribution of birthdays, so we randomly seeded the NORMINV function with random value between 0 and 1.  We used a mean of 21,000 (June 29, 1957) and a standard deviation of 5,000 days (13.7 years) in the NORMINV function to get an acceptable distribution; see below.</p>
<div class="codeBox">
<p><strong>Normally Distributed Date of Birth</strong></p>
<pre>=NORMINV(RAND(),21000,5000)</pre>
</div>
<h3>Telephone Number</h3>
<p>Random telephone numbers were generated in the same manner as the social security numbers, by concatenation of three RANDBETWEEN operations; see below.</p>
<div class="codeBox">
<p><strong>Telephone Number</strong></p>
<pre>=CONCATENATE(INT(RANDBETWEEN(100,999)),"-",INT(RANDBETWEEN(100,999)),"-",MID((RANDBETWEEN(10000,19999)),2,4))</pre>
</div>
<h3>Email Address</h3>
<p>Fictitious patient email addresses were constructed from each fictitious patient&#8217;s first name, last name and the domain name &#8220;@example.com&#8221;; see below.</p>
<div class="codeBox">
<p><strong>Email Address</strong></p>
<pre>=CONCATENATE(C5,".",B5,"@example.com")</pre>
<p>where columns B and C contain the last and first names, respectively.  In the example above we are working on row 5.</p>
</div>
<h3>Stability of the Excel Random Cells</h3>
<p>We found that the randomly created cells in Excel were not stable.  If such a cell were edited, previous random values would be regenerated, effecting all the cell that depended on that random number.  Therefore, the final step was to write out the Patients worksheet as a comma-separated values (CSV) file, and then to read the CSV back into Excel.  In this way we were able to create a stable test Patients table in Excel.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.colabrativ.com/creation-of-test-patients-table-in-microsoft-excel/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
