<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Quantum Forest &#187; statistics</title>
	<atom:link href="http://quantum.uncronopio.org/category/statistics/feed/" rel="self" type="application/rss+xml" />
	<link>http://quantum.uncronopio.org</link>
	<description>logs written by Luis Apiolaza in Christchurch, New Zealand</description>
	<lastBuildDate>Sun, 10 Jan 2010 21:24:21 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Default to R</title>
		<link>http://quantum.uncronopio.org/2009/09/08/default-to-r/</link>
		<comments>http://quantum.uncronopio.org/2009/09/08/default-to-r/#comments</comments>
		<pubDate>Mon, 07 Sep 2009 23:37:07 +0000</pubDate>
		<dc:creator>Luis</dc:creator>
				<category><![CDATA[software]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://quantum.uncronopio.org/?p=385</guid>
		<description><![CDATA[Last March§ I posted an explanation of the issues behind getting R accepted in our School for teaching statistics.
At School level, I needed to spend substantial time compiling information to prove that R could satisfy my colleagues&#8217; statistical needs. Good selling points were lme4, lattice/ggplot and pointing my most statistically inclined colleagues to CRAN. Another [...]]]></description>
			<content:encoded><![CDATA[<p>Last March<sup><a href="http://quantum.uncronopio.org/2009/03/12/teaching-stats-and-software/">§</a></sup> I posted an explanation of the issues behind getting R accepted in our School for teaching statistics.</p>
<p>At School level, I needed to spend substantial time compiling information to prove that R could satisfy my colleagues&#8217; statistical needs. Good selling points were lme4, lattice/ggplot and pointing my most statistically inclined colleagues to CRAN. Another important issue was the ability to have a GUI (Rcmdr) that could be adapted to our specific needs. We will develop an extra menu item to fit non-linear models for growth models used in forestry. Our School has now adopted R as the default software for teaching any statistical content during the four years of the curriculum.</p>
<p>At the university level, my questions to the department of Mathematics and Statistics sparkled a lot of internal discussion, which resulted in R being adopted as the standard software for some of the second year courses (it was already the standard for most courses in 3<sup>rd</sup> and 4<sup>th</sup> year). The decision was not unanimous, particularly because for statisticians knowing SAS is one of those &#8216;must be in the CV&#8217; skills, but they went for change. The second year courses are offered across colleges, which makes the change very far reaching. These changes will also imply that in the near future many computers in the university will come with R preinstalled. </p>
<p>It is nice to see interesting changes once in a while.</p>
]]></content:encoded>
			<wfw:commentRss>http://quantum.uncronopio.org/2009/09/08/default-to-r/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Using R for genetic analyses</title>
		<link>http://quantum.uncronopio.org/2009/06/11/using-r-for-genetic-analyses/</link>
		<comments>http://quantum.uncronopio.org/2009/06/11/using-r-for-genetic-analyses/#comments</comments>
		<pubDate>Wed, 10 Jun 2009 22:39:20 +0000</pubDate>
		<dc:creator>Luis</dc:creator>
				<category><![CDATA[genetics]]></category>
		<category><![CDATA[mac]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://quantum.uncronopio.org/?p=374</guid>
		<description><![CDATA[As some people know, I have been using asreml for genetic analyses for quite a few years and even keep the ASReml Cookbook§. I was quite happy to see the development of asreml-R, a package that makes available most of ASReml&#8217;s functionality from R. This made my life easier: I still use plain-vanilla ASReml for [...]]]></description>
			<content:encoded><![CDATA[<p>As some people know, I have been using asreml for genetic analyses for quite a few years and even keep the ASReml Cookbook<sup><a href="http://uncronopio.org/ASReml">§</a></sup>. I was quite happy to see the development of asreml-R, a package that makes available most of ASReml&#8217;s functionality from R. This made my life easier: I still use plain-vanilla ASReml for very big jobs, but I can access a much more comprehensive statistical system for fairly substantial jobs.</p>
<p>One of my main problems with asreml-R is that is not available for OSX (mac). Yes, I can dualboot or use a virtual box, but both options are a bit of a pain. I rather use my computer with its primary operating system and no strange overheads. I have requested several times to have a mac version. It seems that the code can be compiled without problems, but it is the license management software that is not available for the mac.</p>
<p>I then started looking for options to run genetic analyses. <em>nlme</em> was designed around hierarchical models and fitting experimental designs did not feel right. <em>lme4</em> is looking good and the main issue was around fitting pedigrees, a matter at least partially solved by the <em>pedigreemm</em> package. I then came across the <em>MCMCglmm</em><sup><a href="http://cran.r-project.org/web/packages/MCMCglmm/index.html">§</a></sup> package, which has some nice features: it makes Bayesian analyses accessible, ready support for pedigrees and a syntax not that different from asreml-R.</p>
<p>After playing with the MCMCglmm library, I found that I could not use pedigrees with parents acting both as males and females. I modified the code (line 26 of inverseA.R) to print a warning rather than to stop and the compiled the library again. Voila! it is working (the beauty of having access to the source).</p>
<p><code><br />
R CMD INSTALL /Users/lap44/Downloads/MCMCglmm --library=/Users/lap44/Library/R/2.9/library<br />
</code></p>
<p>By the way, ASReml is still my primary tool at the moment, but I enjoy having good alternatives.</p>
]]></content:encoded>
			<wfw:commentRss>http://quantum.uncronopio.org/2009/06/11/using-r-for-genetic-analyses/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Teaching stats and software</title>
		<link>http://quantum.uncronopio.org/2009/03/12/teaching-stats-and-software/</link>
		<comments>http://quantum.uncronopio.org/2009/03/12/teaching-stats-and-software/#comments</comments>
		<pubDate>Thu, 12 Mar 2009 08:36:58 +0000</pubDate>
		<dc:creator>Luis</dc:creator>
				<category><![CDATA[software]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://quantum.uncronopio.org/?p=365</guid>
		<description><![CDATA[Forestry deals with variability and variability is the province of statistics. The use of statistics permeates forestry: we use sampling for inventory purposes, we use all sort of complex linear and non-linear regression models to predict growth, linear mixed models are the bread and butter of the analysis of experiments, etc. 
I think it is [...]]]></description>
			<content:encoded><![CDATA[<p>Forestry deals with variability and variability is the province of statistics. The use of statistics permeates forestry: we use sampling for inventory purposes, we use all sort of complex linear and non-linear regression models to predict growth, linear mixed models are the bread and butter of the analysis of experiments, etc. </p>
<p>I think it is fair to expect foresters to be at least acquainted with basic statistical tools, and we have two courses covering ANOVA and regression. In addition, we are supposed to introduce/reinforce statistical concepts in several other courses. So far so good, until we reach the issue of software.</p>
<p>During the first year of study, it is common to use MS Excel. I am not a big fan of Excel, but I can tolerate its use: people do not require much training to (ab)use it and it has a role to introduce students to some of the &#8217;serious/useful&#8217; functions of a computer; that is, beyond gaming. However, one can hit Excel limits fairly quickly which&#8211;together with the lack of audit trail for the analyses and the need to repeat all the pointing and clicking every time we need an analysis&#8211;makes looking for more robust tools very important.</p>
<p>Our current robust tool is SAS (mostly BASE and STAT, with some sprinkles of GRAPH), which is introduced in second year during the ANOVA and regression courses. SAS is a fine product, however:</p>
<ul>
<li>We spend a very long time explaining how to write simple SAS scripts. Students forget the syntax very quickly.</li>
<li>SAS&#8217;s graphical capabilities are fairly ordinary and not at all conducive to exploratory data analysis.</li>
<li>SAS is extremely expensive, and it is dubious that we could afford to add the point and click module.</li>
<li>SAS tends to define the subject; I mean, it adopts new techniques very slowly, so there is the tendency to do only what SAS can do. This is unimportant for undergrads, but it is relevant for postgrads.</li>
<li>Users tend to store data in SAS&#8217;s own format, which introduces another source of lock-in.</li>
</ul>
<p>In my research work I use mostly ASReml<sup><a href="http://uncronopio.org/ASReml/HomePage">§</a></sup> (for specialized genetic analyses) and R<sup><a href="http://www.r-project.org/">§</a></sup> (for general work), although I am moving towards using ASReml-R (an R library that interfaces ASReml) to have a consistent work environment. For teaching I use SAS to be consistent with second year material.</p>
<p>Considering the previously mentioned barriers for students I have started playing with R-commander<sup><a href="http://socserv.mcmaster.ca/jfox/Misc/Rcmdr/">§</a></sup>, a cross-platform GUI for R created by John Fox (the writer of some very nice statistics books<sup><a href="http://socserv.mcmaster.ca/jfox/Books/Applied-Regression-2E/index.html">§</a></sup>, by the way). As I see it:</p>
<ul>
<li>Its use in command mode is not more difficult than SAS.</li>
<li>We can get R-commander to start working right away with simple(r) methods, while maintaining the possibility of moving to more complex methods later by typing commands or programming.</li>
<li>It is free, so our students can load it into their laptops and keep on using it when they are gone. This is particularly true with international students: many of them will never see SAS again in their home countries.</li>
<li>It allows an easy path to data exploration (pre-requisite for building decent models) and high quality graphs.</li>
<li>R is open source and easily extensible.</li>
</ul>
<p>I think that R would be an excellent fit for teaching; nevertheless, there would be a few drawbacks, mostly when dealing with postgrads:</p>
<ul>
<li>There are restrictions to the size of datasets (they have to fit in memory), although there are ways to deal with some of the restrictions. On the other hand, I have hit the limits of PROC GLM and PROC MIXED before and that is where ASReml shines.</li>
<li>Some people have an investment in SAS and may not like the idea of using a different software.</li>
</ul>
<p>We will see how it goes because&#8211;as someone put it many years ago&#8211;there is always resistance to change:</p>
<blockquote><p>It must be remembered that there is nothing more difficult to plan, more doubtful of success, nor more dangerous to manage, than the creation of a new system. For the initiator has the enmity of all who would profit by the preservation of the old institutions and merely lukewarm defenders in those who would gain by the new ones.—<em>Niccolò Machiavelli, The Prince, Chapter 6.</em><sup><a href="http://www.liberliber.it/biblioteca/m/machiavelli/il_principe/html/princi_a.htm#capitolo6">§</a></sup></p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://quantum.uncronopio.org/2009/03/12/teaching-stats-and-software/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Multivariate simulation for a breeding program</title>
		<link>http://quantum.uncronopio.org/2009/01/13/multivariate-simulation-for-a-breeding-program/</link>
		<comments>http://quantum.uncronopio.org/2009/01/13/multivariate-simulation-for-a-breeding-program/#comments</comments>
		<pubDate>Mon, 12 Jan 2009 23:01:08 +0000</pubDate>
		<dc:creator>Luis</dc:creator>
				<category><![CDATA[research]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://quantum.uncronopio.org/?p=355</guid>
		<description><![CDATA[The core of multivariate simulation for a breeding program is the generation of observations that follow a given covariance matrix V. Using Cholesky decomposition (so V = C`C) one can easily generate the desired distribution. I use the R core.sim function as the basic building block for creating base populations, and progeny tests.

# core.sim generates [...]]]></description>
			<content:encoded><![CDATA[<p>The core of multivariate simulation for a breeding program is the generation of observations that follow a given covariance matrix <strong>V</strong>. Using Cholesky decomposition (so <strong>V</strong> = <strong>C`C</strong>) one can easily generate the desired distribution. I use the R <code>core.sim</code> function as the basic building block for creating base populations, and progeny tests.</p>

<div class="wp_syntax"><div class="code"><pre class="r r" style="font-family:monospace;"><span style="color: #8ea1b0; font-style: italic;"># core.sim generates n.obs observations, which follow a</span>
<span style="color: #8ea1b0; font-style: italic;"># n.vars multivariate normal distribution with mean 0</span>
<span style="color: #8ea1b0; font-style: italic;"># and variance C`C. That is, it takes the Cholesky</span>
<span style="color: #8ea1b0; font-style: italic;"># decomposition C of a covariance matrix as argument.</span>
<span style="color: #8ea1b0; font-style: italic;"># This function is used by all base population and progeny </span>
<span style="color: #8ea1b0; font-style: italic;"># testing functions.</span>
core<span style="color: #000000;">.sim</span> &lt;- <span style="color: #006600; font-weight: bold;">function</span><span style="color: #ff0000;">&#40;</span><span style="color: #000000;">C</span>, n<span style="color: #000000;">.obs</span>, n.vars<span style="color: #ff0000;">&#41;</span><span style="color: #ff0000;">&#123;</span>
	<span style="color: #000000;">N</span> &lt;- <span style="color: #009900; font-weight: bold;">matrix</span><span style="color: #ff0000;">&#40;</span><span style="color: #000000;">data</span> = <span style="color: #009900; font-weight: bold;">rnorm</span><span style="color: #ff0000;">&#40;</span>n.obs*n.vars<span style="color: #ff0000;">&#41;</span>, 
	            <span style="color: #000000;">nrow</span> = n<span style="color: #000000;">.vars</span>, <span style="color: #000000;">ncol</span> = n.obs<span style="color: #ff0000;">&#41;</span>
	<span style="color: #000000;">S</span> &lt;- <span style="color: #009900; font-weight: bold; font-style: italic;">t</span><span style="color: #ff0000;">&#40;</span><span style="color: #009900; font-weight: bold;">C</span> %*% N<span style="color: #ff0000;">&#41;</span>
	<span style="color: #009900; font-weight: bold;">return</span><span style="color: #ff0000;">&#40;</span>S<span style="color: #ff0000;">&#41;</span>
	<span style="color: #ff0000;">&#125;</span></pre></div></div>

<p>R syntax highlighting courtesy of the WP-syntax plugin (an interface to <a href="http://qbnz.com/highlighter/">GEshi</a>). </p>
]]></content:encoded>
			<wfw:commentRss>http://quantum.uncronopio.org/2009/01/13/multivariate-simulation-for-a-breeding-program/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>My favorite statistics books</title>
		<link>http://quantum.uncronopio.org/2008/11/02/my-favourite-statistics-books/</link>
		<comments>http://quantum.uncronopio.org/2008/11/02/my-favourite-statistics-books/#comments</comments>
		<pubDate>Sun, 02 Nov 2008 07:46:26 +0000</pubDate>
		<dc:creator>Luis</dc:creator>
				<category><![CDATA[books]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://quantum.uncronopio.org/?p=340</guid>
		<description><![CDATA[A large proportion of my work involves using statistics, mostly analyzing progeny trials or other smaller experimental designs. On terms of statistical techniques, this means generalized linear mixed models (GLMM) with at least one of the random effects with non-independent observations (a pedigree). Variations of the topic include multivariate analyses, longitudinal analyses, spatial analyses. Most [...]]]></description>
			<content:encoded><![CDATA[<p>A large proportion of my work involves using statistics, mostly analyzing progeny trials or other smaller experimental designs. On terms of statistical techniques, this means generalized linear mixed models (GLMM) with at least one of the random effects with non-independent observations (a pedigree). Variations of the topic include multivariate analyses, longitudinal analyses, spatial analyses. Most of the time the traits are normally distributed, but some times I end up with binary or count traits.</p>
<p>I have checked quite a few books and have some favorites:</p>
<ul>
<li><a href="http://www.amazon.com/gp/product/0534159001?ie=UTF8&#038;tag=luisapiolaswe-20&#038;linkCode=as2&#038;camp=1789&#038;creative=390957&#038;creativeASIN=0534159001">Regression with Graphics: A Second Course in Applied Statistics</a> by Lawrence Hamilton. I have a soft spot for this &#8216;newbie&#8217; book, which I think presents ideas in a very easy to follow way, always emphasizing understanding the data by using graphics. It is a good beginner&#8217;s book.</li>
<li><a href="http://www.amazon.com/gp/product/0387952322?ie=UTF8&#038;tag=luisapiolaswe-20&#038;linkCode=as2&#038;camp=1789&#038;creative=390957&#038;creativeASIN=0387952322">Regression Modeling Strategies</a> by Frank Harrell. This is currently my favorite &#8216;advanced regression modeling&#8217; book. It contains plenty of practical advice, as well as S+/R code examples to fit almost anything. I learned quite a bit on logistic regression from here, and his example modeling probability of surviving the sinking of the Titanic is a classic. I use a simplified version of this example when teaching.</li>
<li><a href="http://www.amazon.com/gp/product/0521009766?ie=UTF8&#038;tag=luisapiolaswe-20&#038;linkCode=as2&#038;camp=1789&#038;creative=390957&#038;creativeASIN=0521009766">Experimental Design and Data Analysis for Biologists</a> by Gerry Quinn and Michael Keough. This is one of those &#8217;statistics for biologists&#8217; books, but with a big difference: it assumes that the reader is intelligent. It covers a lot of ground, but always presenting a (relatively) modern approach to design an analysis. It covers statistical power for ecological experiments in a very satisfactory (and clear) way. In my opinion, it is much more useful than <a href="http://www.amazon.com/gp/product/013081542X?ie=UTF8&#038;tag=luisapiolaswe-20&#038;linkCode=as2&#038;camp=1789&#038;creative=390957&#038;creativeASIN=013081542X">Zar</a>&#8217;s and <a href="http://www.amazon.com/gp/product/0716724111?ie=UTF8&#038;tag=luisapiolaswe-20&#038;linkCode=as2&#038;camp=1789&#038;creative=390957&#038;creativeASIN=0716724111">Sokal</a>&#8217;s &#8217;stats for biologists&#8217; books.</li>
<li><a href="http://www.amazon.com/gp/product/052168689X?ie=UTF8&#038;tag=luisapiolaswe-20&#038;linkCode=as2&#038;camp=1789&#038;creative=390957&#038;creativeASIN=052168689X">Data Analysis Using Regression and Multilevel/Hierarchical Models</a> by Andrew Gelman and Jennifer Hill. This book&#8211;as Hamilton&#8217;s&#8211;comes from Social Sciences. It is really enjoyable, has interesting examples and tackle modern approaches, including Bayesian Stats. I am currently trying to read it from cover to cover. If one has some background on stats I would recommend starting with this one, as is quickly becoming one of my favorite books on linear models (<a href="http://www.amazon.com/review/RPRMFR3F1S3SW/ref=cm_cr_rdp_perm">my review</a>).</li>
<li><a href="http://www.amazon.com/gp/product/0470009616?ie=UTF8&#038;tag=luisapiolaswe-20&#038;linkCode=as2&#038;camp=1789&#038;creative=390957&#038;creativeASIN=0470009616">Matrix Algebra Useful for Statistics (Wiley Series in Probability and Statistics)</a> by Shayle Searle. I have the original hard cover edition and read it from cover to cover during my Ph.D. I think it gives a very good background if one wants to understand what is going on in ANOVA and regression, particularly when trying to figure out statistical software output (<a href="http://www.amazon.com/review/R209IG6D6501BM/ref=cm_cr_rdp_perm">my review</a>).</li>
<li><a href="http://www.amazon.com/gp/product/0851990002?ie=UTF8&#038;tag=luisapiolaswe-20&#038;linkCode=as2&#038;camp=1789&#038;creative=390957&#038;creativeASIN=0851990002">Linear Models for the Prediction of Animal Breeding Values (Cabi Publishing)</a> by R. (what does the R stand for?) Mrode. I have the first edition, which provides a good&#8211;although a bit terse&#8211;introduction to BLUP (Best Linear Unbiased Prediction) as used in quantitative genetics models.</li>
</ul>
<p>I will keep on expanding this list when I remember other titles.</p>
<p>Updated: 2008-11-05.</p>
]]></content:encoded>
			<wfw:commentRss>http://quantum.uncronopio.org/2008/11/02/my-favourite-statistics-books/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Without assumptions</title>
		<link>http://quantum.uncronopio.org/2008/03/07/without-assumptions/</link>
		<comments>http://quantum.uncronopio.org/2008/03/07/without-assumptions/#comments</comments>
		<pubDate>Thu, 06 Mar 2008 21:17:24 +0000</pubDate>
		<dc:creator>Luis</dc:creator>
				<category><![CDATA[quotes]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://quantum.uncronopio.org/2008/03/07/without-assumptions/</guid>
		<description><![CDATA[I have been working a lot on statistical analyses this week, looking at issues of data relatedness, connectedness between sites and overall data quality measurements. A quote to remember:

Without assumptions there can be no conclusions — John Tukey

]]></description>
			<content:encoded><![CDATA[<p>I have been working a lot on statistical analyses this week, looking at issues of data relatedness, connectedness between sites and overall data quality measurements. A quote to remember:</p>
<blockquote><p>
Without assumptions there can be no conclusions — <em>John Tukey</em>
</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://quantum.uncronopio.org/2008/03/07/without-assumptions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Mental chaos</title>
		<link>http://quantum.uncronopio.org/2006/09/11/mental-chaos/</link>
		<comments>http://quantum.uncronopio.org/2006/09/11/mental-chaos/#comments</comments>
		<pubDate>Sun, 10 Sep 2006 22:39:43 +0000</pubDate>
		<dc:creator>Luis</dc:creator>
				<category><![CDATA[books]]></category>
		<category><![CDATA[miscellanea]]></category>
		<category><![CDATA[music]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[writing]]></category>

		<guid isPermaLink="false">http://quantum.uncronopio.org/2006/09/11/mental-chaos/</guid>
		<description><![CDATA[Eleven of September again, but I am not thinking of 2001&#8212;which was terrible, I agree&#8212;but of 1973. I always stop for a minute to remember as an old political campaign said: without hate, without fear, without violence. I was six and still remember.
I am preparing lectures, really busy. I took the students after more than [...]]]></description>
			<content:encoded><![CDATA[<p>Eleven of September again, but I am not thinking of 2001&#8212;which was terrible, I agree&#8212;but of 1973. I always stop for a minute to remember as an <a href="http://es.wikipedia.org/wiki/Plebiscito_Nacional_de_1988_(Chile)" title="link to Wikipedia in Spanish">old political campaign</a> said: without hate, without fear, without violence. I was six and still remember.</p>
<p>I am preparing lectures, really busy. I took the students after more than a semester and most of them appear to be clueless. I am going over a general review, again, because they <em>have</em> to learn. Always listening to energetic music when preparing lectures or running analyses. This time is Rammstein&#8217;s &#8216;Sehnsucht&#8217;. But, anyway, I also like to go back a few years and Yes&#8217;s &#8216;Close to the Edge&#8217; (yes, from 1972) is now coming from the headphones.</p>
<p>I start writing most of the ideas in Writeroom and then copy the lot to TeXShop. Packages included in the document preamble for the notes:</p>

<div class="wp_syntax"><div class="code"><pre class="latex latex" style="font-family:monospace;"><span style="color: #8ea1b0; font-style: italic;">%! program = pdflatex</span>
<span style="color: #800000; font-weight: normal;">\documentclass</span><span class="br0">&#91;</span><span style="color: #2222D0; font-weight: normal;">11pt</span><span class="br0">&#93;</span><span class="br0">&#123;</span><span style="color: #2222D0; font-weight: normal;">article</span><span class="br0">&#125;</span>
<span style="color: #800000; font-weight: normal;">\usepackage</span><span class="br0">&#123;</span><span style="color: #2222D0; font-weight: normal;">pslatex</span><span class="br0">&#125;</span>
<span style="color: #800000; font-weight: normal;">\usepackage</span><span class="br0">&#123;</span><span style="color: #2222D0; font-weight: normal;">amsbsy</span><span class="br0">&#125;</span>
<span style="color: #800000; font-weight: normal;">\usepackage</span><span class="br0">&#123;</span><span style="color: #2222D0; font-weight: normal;">graphicx</span><span class="br0">&#125;</span>
<span style="color: #800000; font-weight: normal;">\usepackage</span><span class="br0">&#123;</span><span style="color: #2222D0; font-weight: normal;">geometry</span><span class="br0">&#125;</span>
<span style="color: #800000; font-weight: normal;">\geometry</span><span class="br0">&#123;</span><span style="color: #2222D0; font-weight: normal;">a4paper</span><span class="br0">&#125;</span></pre></div></div>

<p>Although the Statistics course uses Mendenhall &#38; Sincich&#8217;s &#8216;Regression Analysis&#8217; as the basic text, I am using quite a few other references:</p>
<ul>
<li>Quinn and Keough&#8217;s &#8216;Experimental design and data analysis for biologists&#8217;.</li>
<li>Searle&#8217;s &#8216;Linear models&#8217;.</li>
<li>Harrell&#8217;s &#8216;Regression modeling strategies&#8217;.</li>
<li>Steel and Torrie&#8217;s &#8216;Principles and procedures of statistics&#8217; (the first edition!).</li>
<li>Hamilton&#8217;s &#8216;Regression with graphics&#8217;.</li>
<li>Neter and Wasserman&#8217;s &#8216;Applied linear statistical models&#8217;.</li>
</ul>
<p>In addition I am also preparing grant applications, so if you try to contact me please be understanding: I will not read my email (or act upon any emails) until Wednesday next week (around September 20<sup>th</sup>).</p>
<p>Moved to King Crimson&#8217;s &#8216;Discipline&#8217;. Now listening &#8216;Elephant talk&#8217;&#8230;Talk is ony talk&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://quantum.uncronopio.org/2006/09/11/mental-chaos/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Current obsessions</title>
		<link>http://quantum.uncronopio.org/2006/03/21/current-obsessions/</link>
		<comments>http://quantum.uncronopio.org/2006/03/21/current-obsessions/#comments</comments>
		<pubDate>Tue, 21 Mar 2006 08:31:29 +0000</pubDate>
		<dc:creator>Luis</dc:creator>
				<category><![CDATA[gardening]]></category>
		<category><![CDATA[genetics]]></category>
		<category><![CDATA[miscellanea]]></category>
		<category><![CDATA[photos]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://quantum.uncronopio.org/2005/03/21/current-obsessions/</guid>
		<description><![CDATA[I have slightly changed the focus of my attention during the last month or so. My current obsessions are:

Optimisation of breeding programs, for which I am learning to use AMPL, with Fritz&#8217;s help. I will need to create and format some data to include in some simulations analysed by AMPL and I think I will [...]]]></description>
			<content:encoded><![CDATA[<p>I have slightly changed the focus of my attention during the last month or so. My current obsessions are:</p>
<ul>
<li>Optimisation of breeding programs, for which I am learning to use <a href="http://www.ampl.com" title="A Mathematical Programming Language">AMPL</a>, with <a href="http://www.mang.canterbury.ac.nz/people/jfraffen/">Fritz&#8217;s</a> help. I will need to create and format some data to include in some simulations analysed by AMPL and I think I will use <a href="http://www.python.org">Python</a> to prototype them. If I run in to speed bottlenecks I will reimplement numerically intensive processes in either C++ or Fortran 95.</li>
<li>Genetics of wood properties. After some early forages on wood properties&#8212;which finished when Carolyn changed jobs&#8212;I am back at it. <a href="http://www.fore.canterbury.ac.nz/people/walker.shtml">John</a> has been very welcoming and we are trying to put a couple of projects together. We should have some early results by mid next year.</li>
</ul>
<p>There are a few bits and pieces that do not fall in these two broad areas, but they will converge pretty soon.</p>
<h3>Family-wise</h3>
<p>Working with Marcela and Orlando in the veggie patch. I have never had much of a green thumb, but I am really trying. We sowed coriander, parsley and chervil, and planted <a href="http://www.asiafood.org/bokchoi.cfm">bok choi</a>, onions, dill, lemon balm and capsicum. Apart from the capsicum seedlings that are struggling (a drainage problem is my guess) everything is doing fine.</p>
<p><img src='http://quantum.uncronopio.org/wp-content/uploads/2008/01/marcelaandorlando.jpg' alt='Marcela and Orlando checking worm farm' /></p>
<p>Marcela&#8217;s worm farm is the old-new addition. We used to have a worm farm in Australia, but due to quarantine issues, we decided to leave it there. So we needed to get a new one plus order the first batch of worms by mail.</p>
]]></content:encoded>
			<wfw:commentRss>http://quantum.uncronopio.org/2006/03/21/current-obsessions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>LIDAR ground truthing</title>
		<link>http://quantum.uncronopio.org/2005/08/01/lidar-ground-truthing/</link>
		<comments>http://quantum.uncronopio.org/2005/08/01/lidar-ground-truthing/#comments</comments>
		<pubDate>Mon, 01 Aug 2005 07:24:41 +0000</pubDate>
		<dc:creator>Luis</dc:creator>
				<category><![CDATA[forestry]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://quantum.uncronopio.org/2005/08/01/lidar-ground-truthing/</guid>
		<description><![CDATA[In the previous post I was hopeful that our simple LIDAR data processing would be able to locate the trees, but I did not have any evidence to confirm that what we saw in the computer was not a fluke.
After postponing field work due to bad weather, last Thursday we had a field crew assessing [...]]]></description>
			<content:encoded><![CDATA[<p>In the <a href="http://quantum.uncronopio.org/2005/07/21/first-look-at-lidar/">previous post</a> I was hopeful that our simple LIDAR data processing would be able to locate the trees, but I did not have any evidence to confirm that what we saw in the computer was not a fluke.</p>
<p>After postponing field work due to bad weather, last Thursday we had a field crew assessing six GPS located plots. Today David overlayed the plot data, after differentially correcting it, and the position of the trees estimated with LIDAR and ta-da! the trees pretty much match. There are some random differences (of up to 2 meters) in position, but considering that the stem is not necessarily under the top of the tree and that there was quite a bit of data processing and smoothing, the results seem to be extremely promising.</p>
<p>We now need to formally estimate the association between positions, explain any errors of detection (trees in the ground but missing in LIDAR or vice versa) and prepare a short report.</p>
<p>Some times work is sweet, particularly when reality and models match one another.</p>
]]></content:encoded>
			<wfw:commentRss>http://quantum.uncronopio.org/2005/08/01/lidar-ground-truthing/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>First look at LIDAR</title>
		<link>http://quantum.uncronopio.org/2005/07/21/first-look-at-lidar/</link>
		<comments>http://quantum.uncronopio.org/2005/07/21/first-look-at-lidar/#comments</comments>
		<pubDate>Thu, 21 Jul 2005 07:12:57 +0000</pubDate>
		<dc:creator>Luis</dc:creator>
				<category><![CDATA[forestry]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://quantum.uncronopio.org/2005/07/21/first-look-at-lidar/</guid>
		<description><![CDATA[This has been a very busy and intense week. Deviations from routine are &#8212; at least most of the time &#8212; welcome, and particularly so in this case. 
LIDAR is a fascinating technology, which until last week I always thought to be very overrated. People were always presenting pretty pictures, but nothing really useful. Last [...]]]></description>
			<content:encoded><![CDATA[<p>This has been a very busy and intense week. Deviations from routine are &#8212; at least most of the time &#8212; welcome, and particularly so in this case. </p>
<p><acronym title="LIght Detection And Raging">LIDAR</acronym> is a fascinating technology, which until last week I always thought to be very overrated. People were always presenting pretty pictures, but nothing really useful. Last Friday we had a meeting where we discussed using <a href="http://www.ghcc.msfc.nasa.gov/sparcle/sparcle_tutorial.html" title="link to NASA">LIDAR</a>, aerophotography, <a href="http://www.digitalglobe.com/">Quickbird</a> and field measurements in some native forest to obtain stand level attributes and make thinning decisions. Finally a practical application!</p>
<p>On Monday David, Jamie and I got the text files (with x, y and z coordinates, and intensity). The files were huge, with a total of 150,000 points to describe 10 ha. I first read them with <a href="http://insightful.com">Splus</a> and <a href="http://www.r-project.org">R</a>, which clearly didn&#8217;t like much the size of the matrices. Maybe Splus 7 developer (with the new pipeline architecture) works better, but I do not have a copy yet.</p>
<p>At the same time, I imported the data into <a href="http://mayavi.sourceforge.net/">MayaVi</a> (which uses VTK) using an adaptation of <a href="http://public.kitware.com/pipermail/vtkusers/2003-December/070658.html">this Python script</a> and producing the following visualisation to get a feeling of the site. It is easy to interact with the picture, and one can see the road and the difference between old trees and regeneration. However, it is still pretty useless.</p>
<p><img src='http://quantum.uncronopio.org/wp-content/uploads/2008/01/lidardatainmayavi.jpg' alt='LIDAR data in MayaVi' /></p>
<p>Trying to work with individual (non-grid) points proved to be fruitless. The size of the problem is huge, and trying to use any spatial statistics contained in the R module <a href="http://sal.uiuc.edu/csiss/Rgeo/">spatstat</a> was hopeless. Trying to apply anything that requires a large distance matrix (150,000&#215;150,000) is guaranteed failure.</p>
<p>Meanwhile, Jamie helped David to obtain crown and terrain surfaces using Kriging in <a href="http://www.saga-gis.uni-goettingen.de/html/index.php">SAGA</a>. </p>
<p><img src='http://quantum.uncronopio.org/wp-content/uploads/2008/01/lidarheightsurface.jpg' alt='LIDAR height surface' /></p>
<p>After transforming them in grids, and substracting them to get &#8216;true&#8217; tree heights, David and I came up with a naive (slow but apparently effective) way of finding the top of the trees. After a few problems with the implementation and Simon&#8217;s help using a SQL query in <a href="http://www.manifold.net/">Manifold</a>, we were able to identify the tops and their respective heights. From there to a diameter/height and volume equations there is a small step. </p>
<p><img src='http://quantum.uncronopio.org/wp-content/uploads/2008/01/topoftreesinsmallsection.jpg' alt='Top of trees in small section' /></p>
<p>Now we need to move to ground truthing and thinning strategies.</p>
<p>PS. 2005-08-01. We are processing data from ground truthing and our work is looking even <a href="http://quantum.uncronopio.org/2005/08/01/lidar-ground-truthing/" title="link to Quantum Forest">more promising</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://quantum.uncronopio.org/2005/07/21/first-look-at-lidar/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
