<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Ashish Agarwal &#187; High Performance Computing</title>
	<atom:link href="http://ashishagarwal.org/tag/high-performance-computing/feed/" rel="self" type="application/rss+xml" />
	<link>http://ashishagarwal.org</link>
	<description></description>
	<lastBuildDate>Mon, 11 Mar 2019 21:28:46 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=4.1.42</generator>
	<item>
		<title>Functional Big-Data Genomics</title>
		<link>http://ashishagarwal.org/2012/09/11/functional-big-data-genomics/</link>
		<comments>http://ashishagarwal.org/2012/09/11/functional-big-data-genomics/#comments</comments>
		<pubDate>Tue, 11 Sep 2012 20:11:51 +0000</pubDate>
		<dc:creator><![CDATA[ashish]]></dc:creator>
				<category><![CDATA[Presentations]]></category>
		<category><![CDATA[Bioinformatics]]></category>
		<category><![CDATA[High Performance Computing]]></category>
		<category><![CDATA[OCaml]]></category>

		<guid isPermaLink="false">http://ashishagarwal.org/?p=180</guid>
		<description><![CDATA[Abstract High-throughput genomic sequencing is characterized by large diverse datasets and numerous analysis methods. It is normal for an individual bioinformatician to work with thousands of data files and employ hundreds of distinct computations during the course of a single &#8230; <a href="http://ashishagarwal.org/2012/09/11/functional-big-data-genomics/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><strong>Abstract</strong><br />
High-throughput genomic sequencing is characterized by large diverse datasets and numerous analysis methods. It is normal for an individual bioinformatician to work with thousands of data files and employ hundreds of distinct computations during the course of a single project. This problem is magnified in &#8220;core facilities&#8221;, which support multiple researchers working on diverse projects. Most investigators use ad hoc methods to manage this complexity with dire consequences: analyses frequently fail to meet the scientific mandate of reproducibility; improved analysis methods are often not considered because recomputing all downstream steps would be overwhelming; hard drives and CPUs are used sub-optimally; and, in some cases, raw data is lost.</p>
<p>We describe HITSCORE, an OCaml software stack that operates all computational aspects of the Genomics Core Facility at New York University&#8217;s Center for Genomics and Systems Biology. HITSCORE has been in production use for one year, and was implemented quickly by less than two programmers following design advice from several biologists. A simple domain specific language (DSL) enables generating type safe database bindings and GUI components, and greatly eases updates and migration of our data model. We found a multi-lingual stack too burdensome in a small team setting, and credit OCaml for fulfilling the needs of our full application stack. It has good database bindings, excels at encoding complex domain logic, and now allows construction of rich websites due to the Ocsigen web programming framework. Higher level libraries for distributed computing would be a welcome improvement.</p>
<p>The opportunity to build this system did not stem directly from any strength of functional programming or OCaml. It was necessary for a person with credibility amongst biologists to champion its development, and this credibility took several years to build. Rapid development appears to be the single most important factor in allaying doubts about using a lesser known language, and we will briefly describe our experiences in bringing OCaml to several high profile projects.</p>
<p><a class="pdf" href='http://ashishagarwal.org/wp-content/uploads/2012/09/Functional-Big-Data-Genomics-CUFP2012.pdf'>Download slides</a><br />
<a class="television" href="http://www.youtube.com/watch?feature=plcp&#038;v=02YykaSMP0I">Video</a></p>
<p><strong>Citation</strong><br />
Ashish Agarwal, Sebastien Mondet, Paul Scheid, Aviv Madar, Richard Bonneau, Jane Carlton, Kristin C. Gunsalus. Functional Big-Data Genomics. <em><a href="http://cufp.org/conference/2012">Commercial Users of Functional Programming 2012</a></em>, Copenhagen, Denmark, Sep 15, 2012.</p>
]]></content:encoded>
			<wfw:commentRss>http://ashishagarwal.org/2012/09/11/functional-big-data-genomics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A Domain Specific Language Stack for Bio HPC</title>
		<link>http://ashishagarwal.org/2012/05/11/dsl-stack-for-bio-hpc/</link>
		<comments>http://ashishagarwal.org/2012/05/11/dsl-stack-for-bio-hpc/#comments</comments>
		<pubDate>Fri, 11 May 2012 22:42:07 +0000</pubDate>
		<dc:creator><![CDATA[ashish]]></dc:creator>
				<category><![CDATA[Presentations]]></category>
		<category><![CDATA[High Performance Computing]]></category>

		<guid isPermaLink="false">http://ashishagarwal.org/?p=173</guid>
		<description><![CDATA[We have given several presentations on our DSL approach to the management and analysis of big-data in the field of Biology. Now, with Karl Ward, we have been extending this approach to the systems layer, enabling more robust management and &#8230; <a href="http://ashishagarwal.org/2012/05/11/dsl-stack-for-bio-hpc/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>We have given several presentations on our DSL approach to the management and analysis of big-data in the field of Biology. Now, with Karl Ward, we have been extending this approach to the systems layer, enabling more robust management and configuration of the hardware and software infrastructure so critical to bioinformatics. Many thanks to Efstratios (Stratos) Efstathiadis for giving us an opportunity to present this work at the first Bio HPC Workshop at NYU&#8217;s Langone Medical Center.</p>
<p>Karl Ward, Sebastien Mondet, Ashish Agarwal. A Domain Specific Language Stack for Bio HPC, <a href="http://www.nyuinformatics.org/hpcbmr/workshop1"><em>First Workshop on High Performance Computing for Biomedical Research</em></a>, Center for Health Informatics and Bioinformatics, New York University Langone Medical Center, May 2012.</p>
]]></content:encoded>
			<wfw:commentRss>http://ashishagarwal.org/2012/05/11/dsl-stack-for-bio-hpc/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Shonan Meeting</title>
		<link>http://ashishagarwal.org/2012/05/09/shonan-meeting/</link>
		<comments>http://ashishagarwal.org/2012/05/09/shonan-meeting/#comments</comments>
		<pubDate>Wed, 09 May 2012 19:11:38 +0000</pubDate>
		<dc:creator><![CDATA[ashish]]></dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[High Performance Computing]]></category>

		<guid isPermaLink="false">http://ashishagarwal.org/?p=165</guid>
		<description><![CDATA[I&#8217;ll be back in Tokyo, this time for the Shonan Meeting on Bridging the Theory of Staged Programming Languages and the Practice of High-Performance Computing. My talk will be on BINQ, a domain-specific-language for genomic computations.]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.nii.ac.jp/shonan/"><img src="http://ashishagarwal.org/wp-content/uploads/2012/05/shonan-meeting-logo-300x67.png" alt="Shonan Meeting Logo" title="Shonan Meeting Logo" width="300" height="67" class="alignleft size-medium wp-image-166" /></a><br />
I&#8217;ll be back in Tokyo, this time for the Shonan Meeting on <a href="http://www.nii.ac.jp/shonan/seminar019/">Bridging the Theory of Staged Programming Languages and the Practice of High-Performance Computing</a>. My talk will be on BINQ, a domain-specific-language for genomic computations.</p>
]]></content:encoded>
			<wfw:commentRss>http://ashishagarwal.org/2012/05/09/shonan-meeting/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
