Wednesday, 25 November 2009

Mining The Source Code

In the last post we saw that accusers are willing to quote mine the released CRU emails, selectively taking a choice phrase at face value and missing the preceding and proceeding context in the longer email.

Now we will see them doing similar with some of the released CRU source code. The released source code included source for some of CRU's surface temperature record and source code for some proxy work.  No climate model source code was released as far as I know, although that hasn't stopped many of the accusers rampantly assuming there has been - presumably either confusing or not knowing the difference between temperature records and climate models.

This post concerns the an accusation which is now spread far and wide all over the internet.

Here is one example:

http://esr.ibiblio.org/?p=1447

Here's the code and comments in question:

;
; Apply a VERY ARTIFICAL correction for decline!!
;
yrloc=[1400,findgen(19)*5.+1904]
valadj=[0.,0.,0.,0.,0.,-0.1,-0.25,-0.3,0.,- 0.1,0.3,0.8,1.2,1.7,2.5,2.6,2.6,$
2.6,2.6,2.6]*0.75 ; fudge factor
if n_elements(yrloc) ne n_elements(valadj) then message,’Oooops!’
;
yearlyadj=interpol(valadj,yrloc,timey)

The accusers point to the words "very artificial", "fudge factor" and to the nature of what is being done. 

yrloc is assigned a 20 element array, the first value starts at 1400, the second at 1904 and the rest increment by 5 until 1994. Ie 1400, 1904, 1909, 1914, ... 1994. They are obviously years.

valadj is another 20 element array, you can see the values it is assigned above in the line "fudge factor". The 'Oooops!' message is displayed if the number of elements in the yrloc and valadj arrays are different. They shouldn't ever be according to the code, this line was probably added in as a first pass safety check and not subsequently removed.

yearlyadj=interpol(valadj,yrloc,timey)

I have to guess a little here (I don't know IDL), but I think this is producing an array yearlyadj to hold an adjustment value for every year since 1400, derived by interpolating yrloc over valadj

Despite so many accusers citing this snippet of code, they amazingly fail to mention (or perhaps notice?) that directly following this snippet is:

;filter_cru,5.,/nan,tsin=yyy+yearlyadj,tslow=tslow
;oplot,timey,tslow,thick=5,color=20
;
filter_cru,5.,/nan,tsin=yyy,tslow=tslow
oplot,timey,tslow,thick=5,color=21

The top line contains yyy+yearlyadj. This is the only place where the previously created adjustment array is used, I presume (I don't know IDL, the language used here) that yyy contains each years temperature data and that this is adding the adjustments to the temperature data to produce the plot. But notice at the start of that line is a semi-colon. That line is commented out, inactivated. The lines that are used instead do not contain the use of yearlyadj and therefore do not apply the adjustment, they only plot yyy.

Of course it would be trivial to switch the comments around and activate the adjustment, but as the accusers are relying on a face-value interpretation of the source code they should fall by such silliness.

They haven't even shown their quoted adjustment was used, let alone what it's purpose is. A proper analysis of this would require knowing what the adjustment was based on (it clearly isn't arbitrary), why it was done (perhaps nothing more than an experiment), and not to forget - whether it was even used at all in published results.

It's not difficult for me to point out why the accusations of fraud are misplaced. All I have to do is point out that they have insufficient evidence. Come back with better, if you can. I am surprised they haven't picked up on the mispelt "artifical", surely that beggars belief - true scientists wouldn't spell words wrong! Quick to the blogs!

Isn't it surprising some of the same people who demand so much evidence when faced with the science behind manmade global warming are surprisingly relaxed at placing accusations of fraud with such a dearth of evidence?

50 comments:

dhogaza said...

'I am surprised they haven't picked up on the mispelt "artifical" ....'

Eric Raymond did, the universe has been restored to its proper order :)

kunzang said...

People on both sides are getting confused by the details. It doesn't matter what the code looks like or says, its what it does.

Same with emails. Who cares about their tone of voice, it's their actions that matter.

guthrie said...

Uh huh, and what do their actions show? Why, they wrote stuff up and got it published, as usual, and oddly enough nobody has any evidence to suggest that the published stuff is wrong...

Professor Mandia said...

Keep up the good work. I would love to see more analysis. I check this blog each day.

OBloodyHell said...

> Why, they wrote stuff up and got it published, as usual, and oddly enough nobody has any evidence to suggest that the published stuff is wrong...

Rather difficult when the original datasets the published data are derived from go missing, innit?

hiroantagonist said...

I'm with OBH. If these emails and files were leaked AND the raw data and final processing algorithms were freely available to the public, this would be a non-story or certainly more easily defensible.

But the fact that there was a clear concerted effort to keep the taxpayer-owned material hidden (or ultimately trashed) despite numerous FOI requests causes us all to wonder why. It puts the results into question, and that is fair and objective.

So all that we're asking is to be shown what has not yet been shown -- we've seen the graph, and the data, and the resulting conclusions, but how were the graphs made and how was the data collected and processed?

This is fair. In fact, this is science.

Martin said...

This adjustment is used right here, is completely legit, with the detailed explanation for its use, with dire warnings on its "artificality", in sections 4.3 and 4.4:

http://74.125.93.132/search?q=cache:www.cru.uea.ac.uk/~timo/papepages/pwosborn_summertemppatt_submit2gpc.pdf

hiroantagonist said...

... and then they immediately go on to say: "Though this is a rather ad hoc approach..."

So, we get it -- it's not necessarily bad science or bad data. It's not an out and out hoax of total data falsification. The problem is that it is data worth re-processing and checking and worth *questioning*. Is this the best or only data that makes the case that humans are causing runaway global warming?

Good scientists would welcome and encourage external experts checking their work -- if it checks out, it is all the more solid. But the emails reveal a profusion of evasion, discussions of hiding, and flaunting the FOI requests.

To his credit, Briffa comes off in the emails (most of the time) as one of the ones who cares about credibility, the overstatement of consensus, and public openness. The same cannot be said of Jones, Mann, and Santer.

krabapple said...

The idea that simply putting the 'raw data and final processing algorithms' out there would end the denialist noise, is naive at best and disingenuous at worst.

Consider for starters the non-trivial difficulty of reproducing such analysis exactly:

tp://moregrumbinescience.blogspot.com/2009/11/data-set-reproducibility.html

hiroantagonist said...

It doesn't matter if it's 'non-trivially difficult', krabapple. If the data can't be reproduced by independent researchers that don't have a stake in whether or not it turns out the same way, it's not good science.

I find it deeply ironic that the ONE field of science that might mean catastrophic disaster or the waste of trillions of dollars is the field that is most resistant to basic, indpendent checking of the data. If it can be picked apart by the critics and still remain standing, it's all the stronger. If it can't be, then it deserves to be thrown out. No truly objective scientists wouldn't want that, uncomfortable though it might be.

So is this -- advocating for fair and open science practices -- just a lot of denialist noise? You'd be happy if we just stopped trying to find better, fairer, more modern ways of analyzing the mountains of existing and new data?

Mark Francis said...

Tim Lambert did the same work, and also dug up the related paper:

http://scienceblogs.com/deltoid/2009/12/quote_mining_code.php

RubinVoelker said...

^^ 謝謝你的分享,祝你生活永遠多彩多姿!.........................

吃太飽 said...

路過--你好嗎..很棒的BLOG.........................................

治士士 said...

希望能有更多心得與我們分享~ ....................................................

怡芳 said...

格主的部落格內容真豐富~~看得很開心 ....................................................

上宜俊宇芳心 said...

Judge not of men and things at first sight.......................................................

v奎峰奎峰 said...

thank you for you to make me learn more,thank you∩0∩ ........................................

虹玟 said...

做愛限制級波霸口交18禁貼圖寫真視訊援交露點爆乳潮吹裸體裸照裸女愛愛無碼尋夢視訊聊天a漫a片a圖一夜情一葉情人妻激情情色寫真美女自拍辣妹自拍正妹自拍美女走光辣妹走光正妹走光脫衣秀

俊翔俊翔 said...

非常感謝~3Q~....................................................

淑娟 said...

Nice post ~ 3Q..............................................................

怡潔 said...

Thanks ~ ~ I will come back yo~................................................................

欣怡 said...

憂能傷身,保重哦!.............................................

惠玲 said...

命運,就是自己行為的結果。..................................................

韋成 said...

你不能改變容貌~~但你可以展現笑容..................................................

家vRegg明 said...

you always know the right thing to say!............................................................

歐英傑 said...

不費勞力而得者,唯貧困而已 ..................................................

宛妹宛妹 said...

Many a little makes a mickle...................................................................

曉薇 said...

真是太有道理了~~我支持你~~~.................................................................

張瑋劭 said...

Poverty is stranger to industry.....................................................................

宛真宛真 said...

喜歡你的部落格,留言請您繼續加油.................................................................

淑娟淑娟 said...

當一個人內心能容納兩樣相互衝突的東西,這個人便開始變得有價值了。............................................................

家賢 said...

人有兩眼一舌,是為了觀察倍於說話的緣故。............................................................

柏強 said...

A friend in need is a friend indeed...................................................................

LupeConstan治士 said...

單純喜歡你的部落格 留個言跟您問聲好~~............................................................

國昆 said...

真有內容的部落格~..................................................................

姿柯瑩柯dgdd憶曾g智曾 said...

來看你了~心在、愛在、牽掛在,幸福才會繁衍不息^^...............................................................

蕙春蕙春 said...

blog有留言互動才好玩~~希望留言能支持您的更新動力!!!............................................................

陳韋夏陳韋夏益東富益東富 said...

值得一看再看的格子,多謝分享.................................................................

承蘋承蘋 said...

Learning makes life sweet.

孫佑霖孫佑霖v said...

一棵樹除非在春天開了花,否則難望在秋天結果。............................................................

怡謝伶 said...

很期待新的內容,幫你推個文^^ ............................................................

吳林怡廷佳宇 said...

耐心是一株很苦的植物,但果實卻很甜美。..................................................

燕明中延 said...

愛情是一種發明,需要不斷改良。只是,這種發明和其他發明不一樣,它沒有專利權,隨時會被人搶走。.................................................................

家唐銘 said...

面對光明,陰影就在我們身後!加油哦!............................................................

淑成李李淑成李李 said...

Pen and ink is wits plough...................................................................

思韓韓韓穎 said...

第一忠誠,第二勤奮,第三專心工作。..................................................

48684 said...

人生中最好的禮物就是屬於自己的一部份............................................................

王辛江淑萍康 said...

我是天山,等待一輪明月。......................................................................

劉陳佳生怡陳佳生修 said...

好看耶~一定每天有空給你支持..................................................

灏群 said...

唯有用熱情、用智慧去觀察事物,這事物才會把他的秘密,洩漏給我們......................................................................

Post a Comment