| 
 
 
 
 
 | Sent:
        November 17, 2003 2:11 PM Dear
        Sirs,   We
        have been carrying out a detailed examination of the paper: Mann,
        M.E., Bradley, R.S. & Hughes, M.K. (1998) Global-Scale Temperature
        Patterns and Climate Forcing Over the Past Six Centuries, Nature,
        392, 779-787, 1998
        (“MBH98”). We have consulted the Nature on-line Supplementary
        Information, used two data files sent to us by Professor Mann’s group
        in April 2003 (responding to a request for the FTP location of proxy
        data used in the above study) and, since Oct. 29, 2003, the information
        at the FTP site ftp://holocene.evsc.virginia.edu/pub/.
        This site is not identified through nature.com nor was it given to us
        upon specific request in April 2003, but is now identified by Professor
        Mann as the actual repository of MBH98 data. 
        
          
        
        There
        are a number of points in MBH98 which need substantial clarification,
        some of which could be resolved with through more adequate Supplementary
        Information. We have sought such clarification from Professor Mann
        without success. With reference to the policies stated at http://www.nature.com/nature/submit/policies/index.html,
        in particular item number 6, we are writing to advise you of a
        persistent refusal to comply with the guidelines and other issues. We
        draw your attention to the following issues: 
        
          
        
        1)
          There is a listing in the
        Nature Supplementary Information of 112 proxy series said to have been
        used in MBH98. In April 2003, in response to our request for an FTP site
        where these data could be found, an associate of Professor Mann (acting
        on his instructions) sent us a file (pcproxy.txt) containing 112 data
        series for the entire period back to 1400, together with a file
        apparently identifying these 112 data series. No FTP site containing
        these proxy series was identified to us at that time nor in subsequent
        correspondence with either Professor Mann nor his associate prior to
        October 29, 2003.  
        
          
        
             
        
        In our analysis, we identified nontrivial problems with the data
        in the file we had been sent. Professor Mann’s associate was unable to
        shed any light on them. When we sought additional information from
        Professor Mann, he informed us that demands on his time prevented him
        from considering this or other inquiries.  
        
          
        
             
        
        We published our findings in Energy
        and Environment 14 (2003). Subsequently, on Oct. 29, 2003, Professor
        Mann stated that the errors we found in pcproxy.txt were inadvertently
        introduced into the file at the time it was sent to us, that we failed
        to consult the correct data at his University of Virginia FTP site and
        that our study wrongly used 112 proxy series, whereas MBH98 is based on
        159 series. Upon inspecting the FTP site on October 29, 2003 (the first
        we learned of its existence), we identified the identical file we had
        been sent in April 2003 (pcproxy.txt), together with its Matlab
        predecessor pcproxy.mat, both dated August 8, 2002, well before our
        request. Sometime between October 29 and November 8, 2003, both of these
        files were deleted from Professor Mann’s FTP site. 
        
          
        
             
        
        As to the number of proxies, MBH98 clearly refers to only 112
        series, as does the on-line Supplementary Information. There is no
        reference to the use of 159 proxy series. We have requested a listing of
        these series from Mann. He has responded that the series are located on
        his FTP site and has refused, despite several requests, to provide more
        particulars. The difficulty is that the FTP site contains over 430
        principal component series commencing in periods after 1400, and the relevant
        series are not flagged. The FTP site also includes many proxy series
        seemingly left out of MBH98. This level of disclosure is clearly
        inadequate. The deletion of archived data files in response to a
        controversy is also very disquieting. At a minimum, we believe that
        Nature should archive the 159 series now disclosed to have been used in
        the MBH98 calculations together with a reconciliation of the difference
        between the 159 series stated to have been actually used and the 112
        series reported in Nature. 
        
          
        
        2)
          Some important data
        relating to MBH98, such as temperature principal components and EOFs,
        was located at Professor Mann’s former University of Massachusetts FTP
        site. Indeed, Professor Mann’s webpage on MBH98 contains no link or
        reference to the University of Virginia FTP site, but does contain links
        to this University of Massachusetts FTP site. On November 13, 2003, this
        FTP site was deleted. Important data pertaining to MBH98 formerly
        located at the University of Massachusetts FTP site is not located at
        the University of Virginia FTP site. 
        
          
        
        3)  
        There are numerous discrepancies between the data listing at the
        Nature SI site and that in the newly-disclosed FTP site at the
        University of Virginia. Many of the proxy series used in MBH98 are
        principal components calculated from larger collections of tree ring
        site “chronologies”. The Nature SI site provides listings of the
        sites reported to have been used in these calculations, with rosters
        identified for five regions (Oklahoma, US Southwest/Mexico, North
        America, South America, and Australia) plus a listing of the number of
        series used for a Vaganov (Russian) collection. We have compared these
        listings with the information on the sites actually used at the FTP site
        and found many inconsistencies. For example, in the South American
        region, only 11 of the 18 sites listed in the Supplementary Information
        are actually used in MBH98 calculations, according to the evidence of
        the FTP site. Moreover there is evidence archived at Professor Mann’s
        FTP site of intentional exclusion of a disclosed South American site. An
        archived email from Hughes to Mann, dated July 29, 1997, and located at ftp://holocene.evsc.virginia.edu/pub/MBH98/TREE/VAGANOV/ORIG/malcolm_29-JUL-97
        states that exclusion of the site arge030 would be “better for our
        purposes”. This site arge030 can be shown to be Rio Cisne (through the
        World Data Center for Paleoclimatology) and is listed in the Nature
        Supplementary Information, but is excluded from the actual data in the
        FTP site. 
        
          
        
        4)
          We found multiple versions
        at the FTP site of several series, in which a secondary series is a
        truncated version of the full underlying data. For instance the FTP site
        contains a correct and complete record of the historical annual Central
        England Temperature data and historical annual Central Europe
        Temperature data. Evidence at the site indicates that MBH98 instead used
        summertime averages from these series, in which the first 70 and 25
        years of data, respectively, were intentionally deleted. No explanation
        is available for this, and the Nature SI site does not indicate or
        explain this editing of source data. 
        
          
        
        5)
          MBH98 states that
        “conventional” principal components methods were used in the
        calculation of temperature principal components. The underlying
        temperature dataset contains endemic missing data; indeed, 4 temperature
        grid cells from the UK Climate Research Unit selected by MBH98 contain
        no observations at all. “Conventional” principal components
        algorithms fail with missing data. Accordingly, MBH98 could not have
        used conventional methods and a proper description of the methodology
        actually employed is required. We sought specific clarification on this
        matter from Professor Mann and were refused. Nor is any available at the
        FTP site. Disclosure would be trivially easy (simply posting a command
        file for a statistics package would suffice). Without such information
        it is effectively impossible for an independent researcher to replicate
        his calculations. 
        
          
        
        6)
          We attempted in our Energy
        and Environment article to assess the impact of the data errors and
        quality control defects in the proxy data, which we had identified. For
        the calculation of principal components for the various tree ring
        regions, we used a standard principal components algorithm (princomp in
        R), which provides values only for periods in which there are no missing
        values. In response, MBH98 have stated that they used something called a
        “stepwise principal components approach”, a phrase which is not used
        in the Nature article or Supplementary Information. Based on their
        recent comments, this “approach” apparently involves changing the
        proxy rosters at some points in the PC series, leading to different
        values of the regional PC depending on the period being analyzed.
        Whatever the merits of the approach, there is no disclosure of it in
        MBH98, leaving the reader uninformed about the existence and formulae of
        a fundamental methodology as well as such basic details as the number
        and composition of principal components used in each region on a
        period-by-period bases. We have requested information from Professor
        Mann allowing us to identify the PC proxy rosters on a period-by-period
        basis and were refused. He deems it sufficient to merely point to his
        FTP site, but having examined this site we are unable to uniquely
        identify the “stepwise” PC rosters or the procedures used to splice
        different series together. 
        
          7)  
        MBH98 states that for the temperature data  “the
        mean was removed, and the series was normalized by its standard
        deviation”. This is, therefore, the method we employed. In their reply
        to our article, MBH criticized us on the grounds that they actually used
         “de-trended gridpoint
        standard deviations” to normalize temperature data. Whether the
        difference is material or not, the fact that their employed methodology
        contradicts the description in
        MBH98 means that independent researchers are unable to replicate their
        work. 
        
          8)  
        Prior to the publication of our article, we  requested
        other particulars on the computational methodology from Professor Mann
        and were refused. Accordingly, we attempted to assess the impact of the
        data problems by following the methodology publicly disclosed in MBH98.
        Professor Mann then criticized us for failing to replicate previously
        undisclosed details of his methodology. We once again requested
        particulars on his methodology, including copies of the computer
        programs used to read in the proxy and temperature series and to produce
        the Northern Hemisphere temperature index—but we have been
        categorically refused.   The
        policies of Nature rightly place a burden on authors to disclose data
        and methods to any interested readers. We have been systematically and
        deliberately stymied by Professor Mann on the most elementary requests:
        a proper listing of his data series and the exact computational
        procedures used. In the process of trying to obtain this information we
        have concluded that the disclosure at the Nature SI site is not merely
        inadequate, but in some cases it contradicts what is now revealed at the
        University of Virginia FTP site.    Under
        the circumstances, we believe that the full data set and accompanying
        programs for MBH98 should now be included in the Nature Supplementary
        Information, along with an accounting of any discrepancies between what
        has been listed at Nature.com to date and what was actually used in
        MBH98.   We
        can make copies of all correspondence and our extensive notes on the
        data issues available for your inspection if needed. Thank you for your
        consideration.   Yours truly 
 Stephen McIntyre Toronto, Canada 
 Ross McKitrick University of Guelph Guelph,
        Canada  
 |