I.“ a II. 39.... n. E ‘1‘. Will .. .i l... . , ....I x. . 7 .c :3 L .01... .u via: .5. 1.. . 1...... , a. .6... .l...$..$ 3.... 1.2)) . 74.3.1... .3 an. .A: 4. fig??? . , Ewiwvn. smifl..h..x . $.....4§..mw? a. . ivfinuxmrs 1.5.1... . ..r 3...}... I)... a... 1... D 1...... t. a L. 5):, 120'} 1.9.2.4.: i... i... i... 1.. 523;: :35 . u u. .35.. ...§.fi.w..». .13 3.13... w, .. mad. I. V . .1. G h , "Mask“. 3|. h. 2 an. xs £- 3. . A". z... .8... 0|...» .1. .r! 133.4“- '7 .k s ‘4 ..... .. lul‘ ”1:34-: .1: .. p 3.24:)..." is \..r .r’...L.... WHWJWQ LIBRARY Michigan State University AI/TN é/éjgai" .’ 9’ This is to certify that the thesis entitled COMPARATIVE ANALYSIS OF FOREST CLASSIFICATION IN FOREST MANAGEMENT INFORMATION DATABASES IN MICHIGAN presented by Nirmal Subedi has been accepted towards fulfillment of the requirements for the Master of Science degree in Forestg pig/W1 a. flag“. MajSiProfessor’s Signature Auqufl" I8, 2005' d . Date MSU is an Affinnative Action/Equal Opportunity Institution ..-.—.--.—--- no--o.—---u---o-q-I-o-u--u.-a-o--u-.-------o-a-n-n--n-c-a-1-.-.-o-n-n-n-o-I-c-o-o-o-n-n-a-n-I--- PLACE IN RETURN Box to remove this checkout from your record. To AVOID FINES return on or before date due. MAY BE RECALLED with earlier due date if requested. DATE DUE DATE DUE DATE DUE 0 ES 5 Z 5 ‘E‘m 2/05 c:/CIRC/DateDus.lndd-p.15 —_‘_. CONIPARATIVE ANALYSIS OF FOREST CLASSIFICATION IN FOREST MANAGEMENT INFORMATION DATABASES IN MICHIGAN By Nirmal Subedi A THESIS Submitted to Michigan State University in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE Department of Forestry 2005 ABSTRACT COMPARATIVE ANALYSIS OF FOREST CLASSIFICATION IN FOREST MANAGEMENT INFORMATION DATABASES IN MICHIGAN By Nirmal Subedi In Michigan, there are four primary sources of forest management information for public forest lands, namely a raster land-cover map (IFMAP), Forest Inventory and Analysis (FIA) plot-level information, Natural Resource Information System Field Sampled Vegetation (NRIS-FSVeg) for national forest lands, and Operations Inventory (01) for state-owned forest lands. The objective of this study is to compare forest classifications between and among the forest management databases with FIA data as the reference location for comparison. Difference matrices were created between and among forest classifications and descriptive accuracy assessments for overall accuracy, producer’s accuracy and user’s accuracy were computed. The overall accuracy of IFMAP with FIA as reference was 63.6% for state forest lands and 64.8% for national forest lands. Overall accuracy of IFMAP with 01 as a reference was 60.3% and IFMAP with NRIS-FSVeg as a reference was 68.3%. Overall accuracy of OI with FIA was 84.5% and NRIS-FSVeg with FIA was 82.2%. Overall accuracy of three-way forest classification was 54.8% and 58.5% for state and national forests lands, respectively. Kappa statistic, calculated from three approaches, ranged from 0.568 to 0.628 for state forest lands and 0.555 to 0.612 for national forest lands. This finding is consistent with a previous study of IFMAP. Copy right by NIRMAL SUBEDI 2005 ACKNOWLEDGEMENTS This study is a collaborative effort Of the Department Of Forestry, MSU, the Michigan Department of Natural Resources, the national forests of Michigan, and FIA unit, NCRS. Many people working in different organizations contributed significantly for the successful completion of this thesis. Committee members, Dr. David MacFarlane and Dr. Rique Campa provided valuable contributions in improving the quality of work. Dr. Larry Pedersen, another committee member, was very instrumental for creating amicable environment for inter-agency collaboration and providing empirical understanding of the problem. In particular, my advisor, Dr. Larry A. Leefers, guided me in the MS program, provided generous support for better understanding the research problem, resolving the issues during data analysis and improving the quality of research work. Similarly, Mr. Geoff Holden, NCRS, played a very crucial and cooperative role to accomplish this study on time. Including my committee members, Dr. Donald Dickmann, Dr. Mark Hansen, Mr. Jason Stephens and Mr. Joseph Gates contributed significantly to improve the crosswalk tables for forest classifications. MSU Faculty Dr. Steven Friedman, Dr. Kelly F. Millenbah and Dr. Murari Suvedi were generous in providing valuable inputs for my research as well. The fellow colleagues at MSU particularly Eric White, Anita Morzillo, Qing Xiang and Murari Regmi were very helpful to me while learning new skills required for this research. I would like to thank Fulbright Scholarship for providing an excellent Opportunity to study MS degree at Michigan State University. Special thanks are due to my spouse Manjana, my daughter, Swastika for their patience and understanding and support. Without their cooperation, it would have been impossible to accomplish my goal. iv TABLES OF CONTENTS LIST OF FIGURES ................................................................................... v CHAPTER 1 INTRODUCTION ..................................................................................... 1 Michigan Statewide Raster Map or IFMAP ................................................ 3 Operational Inventory .......................................................................... 5 Field Sampled Vegetation of Natural Resource Information System (NRISFSVeg) ................................................................................... 6 Forest Inventory Analysis (FIA) ............................................................ 10 Problem Statement ............................................................................ 12 Study Purpose .................................................................................. 13 Organization of Thesis ....................................................................... 14 CHAPTER 2 LITERATURE REVIEW .......................................................................... 16 Land Use Classification ......................................................................................... 16 Michigan Resource Inventory System ...................................................... 21 National Land Cover Database (NLCD) for United States .............................. 22 IFMAP Classification ......................................................................... 23 Forest Classification of FIA/OI/NRIS-FSVeg ............................................ 25 FIA Sampling ................................................................................. 26 Digital Imagery Classification ............................................. . ................. 29 Remote Sensing Based Cover Type Mapping Studies .................................. 35 Accuracy Measurement Methods Used for Remote Sensing Imagery Classification. . . ......................................................................................................... 41 CHAPTER 3 DATA SOURCES, STUDY AREA, CLASSIFICATION APPROACHES AND ANALYSIS METHODS ..................................................................... 51 Data Sources ................................................................................... 51 Study Area ..................................................................................... 53 Classification Approaches ................................................................... 54 Analysis Methods ............................................................................. 55 Chapter Summary .............................................................................. 63 CHAPTER 4 RESULTS AND DISCUSSION .................................................................. 64 Using FIA Forest Types as Reference Classification .................................... 68 Accuracy Assessment for State Forest Lands ............................................. 72 Accuracy Assessment for National Forest Lands .......................................... 77 Three-way accuracy of FIA/OI/IFMAP Assessment in the State Forest Lands ..... 85 Three-way accuracy of FIA/NRIS-FSVeg/IFMAP Assessment for the National Forest Lands ................................................................................... 88 Comparison of Classification Agreement Between IFMAP and FIA for State Forest Lands ................................................................................... 91 Comparison of Classification Agreement Between IFMAP and FIA for National Forest Lands ....................................................... . ........................... 98 Explanation of Differences in Classification of IFMAP, 01 with FIA ............... 104 CHAPTER 5 SUMMARY AND CONCLUSION ............................................................. 110 Conclusions ................................................................................... 1 12 Limitations ................................................................................... 1 13 Policy Implications .......................................................................... 115 Additional Research. . . ............................................................................................ l 16 REFERENCES CITED .......................................................................... 117 APPENDIX A A1. NRIS—FSVeg FIA Crosswalk ....................................................... 125 A2. IFMAP-FIA Crosswalk ............................................................. 126 A3. OI-FIA Crosswalk ................................................................... 127 A4. IFMAP-OI Crosswalk ............................................................... 128 A5. IFMAP-NRIS-Veg Crosswalk ..................................................... 129 vi LIST OF TABLES Table 1. Forest Classification of Michigan Statewide Raster Map (IFMAP) ................. 5 Table 2. Forest Type Classification in Operational Inventory .................................. 7 Table 3. Forest Types Used in Michigan National Forest ....................................... 9 Table 4. FIA Forest Type Classification ......................................................... 12 Table 5. Anderson et a]. (1976) Land Use Classification ...................................... 17 Table 6. Comparison of Forest Management Information Database in Michigan ......... 65 Table 7. Difference Matrix of IFMAP and FIA for State Forest Lands ..................... 73 Table 8. Difference Matrix of OI and FIA for State Forest Lands ........................... 74 Table 9. Difference Matrix of IFMAP and 01 for State Forest Lands ....................... 75 Table 10. Difference Matrix of IFMAP and FIA for National Forest Lands ............... 78 Table 11. Difference Matrix of NRIS-FSVeg and FIA for National Forest Lands. . . .80 Table 12. Difference Matrix of IFMAP and NRIS-FSVeg for National Forest Lands. . ..82 Table 13. Three-way accuracy comparison of FIA, 01 and IFMAP for State Forest Land .......................................................................................... 86 Table 14. Three-way accuracy comparison of FIA, NRIS-FSVeg and IFMAP for National Forest Land ................................................................................ 89 Table 15. Error Matrix of IFMAP-FIA for State Forest Land (UA constant) ............... 93 Table 16. Error Matrix of IFMAP-FIA for State Forest Land (PA constant) ............... 94 Table 17. Error Matrix of IFMAP-FIA for State Forest Land (Aggregated) ................ 95 Table 18. Individual Error Matrix Kappa Analysis Results for State Forest Land ......... 96 Table 19. Kappa Analysis Results for Pairwise comparison of Error Matrices ............ 98 Table 20. Error Matrix of IFMAP-FIA for National Forest Land (UA constant) .......... 99 vii Table 21. Error Matrix of IFMAP-FIA for National Forest Land (PA constant) ......... 100 Table 22. Error Matrix of IFMAP-FIA for National Forest Land (Aggregated) .......... 101 Table 23. Individual Error Matrix Kappa Analysis Results for National Forest Land... 102 Table 24. Kappa Analysis Results for Pairwise comparison of Error Matrices ........... 102 viii LIST OF FIGURES FIGURE 1. National Forest and Digitized State-Owned Forest in Michigan .............. 52 FIGURE 2. Illustration of an Error Matrix or a Confusion Matrix ........................... 57 ix CHAPTER 1 INTRODUCTION Forest resources are increasingly important in providing ecosystem stability, human recreation and other goods and services. Information about forest resources is needed to meet the demand for goods and services from forests, whether in Michigan or elsewhere. “Michigan’s forests are making important contributions to the quality of life by providing a wide array of benefits including wildlife habitat, biological diversity, outdoor recreation, and improved air and water quality. Economic contributions are also significant—an estimated $12 billion of value added and 200,000 jobs annually are supported by forest based industries/ tourism/ recreation” (Schmidt et al. 1997). Michigan forestland area totals 19.3 million acres, and timberland totals 18.7 milliOn acres (MDNR, 2002, Leatherberry et al. 2005). The state timberland acreage is the sixth largest in United States exceeded only by the states of Oregon, Georgia, Alabama, Montana and North Carolina (Smith et al. 2004). Management of forest resources requires information about the resources, such as their location, forest type and other characteristics. In addition, demographic, social and economic information are important to assess demand. Nowadays, publics are demanding a variety of goods and services from forestlands, and multiple use management is the main principle for providing such demands. Economic multiple-use forestry planning is a relatively challenging task and it requires information about resources and their demand. In this way, multiple use forestry planning has enhanced the importance of spatial information of forest stand characteristics, as it compares the economic benefits from joint production and specialized production. This thesis focuseson spatial information regarding forest type as defined by various information databases. Forest type information is one of the most valuable pieces of information for forest managers and planners. It can reveal the types of vegetation that are growing which relates to information about the potential production of goods and services at a particular location. In Michigan, four primary sources provide information for forest types available on public lands. However, they sometimes provide conflicting information. The first is a raster map based on remote sensing satellite imagery information, locally named Integrated Forest Monitoring Assessment and Prescription (IFMAP); it provides . information statewide (PMR, 2001a). The second is Operations Inventory (OI) developed by the Forest Minerals and Fire Management Division of the Michigan Department of Natural Resources (MDNR) which provides information for state-owned forestlands. The third is the Field Sampled Vegetation of the Natural Resource Information System (NRIS-FSVeg), developed by the United States Department of Agriculture-Forest Service (USDA-PS) which has information for national forests. The other information source is the sampled plot information inventoried by the USDA-F8 Forest Inventory and Analysis (FIA) staff. For every FIA plot on state and national forests, there is a corresponding IFMAP pixel (raster) and either an 01 (MDNR) or NRIS-FSVeg (USDA-FS) classification. Michigan Statewide Raster Map or IFMAP IFMAP is a statewide map of land cover over all ownerships within the state of Michigan. It also provides a means for the MDNR to develop management prescriptions for their lands. Only the former is used in this research. Basically, this map is derived from Landsat Thematic Mapper 5 and 7 Enhanced Thematic Mapper plus (ETM +) with some ground verification. There are two separate maps, one for the Upper Peninsula and the other for the Lower Peninsula. This is a coarse resolution map for producing a land cover map and dataset, which can serve multiple functions (MDNR 2003b, MDNR 2003c). It is anticipated that this map will provide a resource for ecosystem—scale management and a statewide planning tool for wildlife habitat (PMR, 2001a). In addition, the pixel-level information of land cover can be integrated to assess the resources on a unit or county-wide scale. Each pixel has spatial resolution of 30 meters X 30 meters (PMR, 2001a). The inventory module of IFMAP records information on present land cover, and the activity tracking module records information on treatments and disturbances (PMR, 2001b). These two types of information are kept in separate GIS map layers. In IFMAP, a proposed treatment polygon can be a subset for a land cover stand or even from several adjacent stands. The percentage of canopy occupied by a given species was used for rules to derive land cover and association categories (PMR, 2001a). The percentage of canopy cover is determined by a sensing mechanism, either satellite-home or airborne. Because it is overhead, there is an extremely limited capability for canopy penetration; below-canopy data is not used in this classification. The classification schemehas been examined by representatives of all regions of the state, and resulting changes/additions made help account for the variety of species, species associations, and land cover types found in Michigan. Land use in IFMAP is classified into eight land-use classes: Urban, Agriculture, Upland Open land, Upland Forest, Lowland Forest, Non-forested Wetland and Bare/Sparsely Vegetated. In the Upland classifications lands not periodically flooded or not on hydric soils are included, and Lowlands refer to lands that are periodically flooded and/or on hydric soils. Land is classified as forest when the canopy cover exceeds the 25% of the ground. The algorithm for the forest type classification can be assessed via the Michigan Geographical Data Library website (MDNR 2003a). IFMAP has classified forestland into 12 different forest types in the Lower Peninsula and into 11 different forest types (excludes other upland deciduous) in the Upper Peninsula (Table 1). There are 20 other non-forestry land use classifications to cover the entire State. Thus, IFMAP facilitates statewide identification of the land-use within 32 land-use categories (for details see, MDNR, 2003e). Table 1. Forest Classification of Michigan Statewide Raster Map (IFMAP) a Forest Land Use Class Forest Type Grid Value Upland Deciduous Northern Hardwood Association 14 Forest Oak Association . 15 Aspen Association 16 Other Upland Deciduous 17 Mixed Upland Deciduous 18 Upland Conifer Forest Pines 19 Other Upland Conifers 20 Mixed Upland Conifers 21 Upland Mixed Forest 22 Lowland Forest Lowland Deciduous Forest 24 Lowland Coniferous Forest 25 Lowland Mixed Forest 26 Source: Michigan Geographic Data Library, 2003 (http://www.mcgi.state.mi.us/mgdl/) a Non-forested land with lower canopy class (less than 25%) include Herbaceous openland, Upland and lowland shrub / Low density trees, Mixed non-forested wetland and Other bare/ sparsely vegetated. Operations Inventory Operations Inventory (01) is an inventory system developed by the Forest, Minerals and Fire Management Division (FMFMD) of the MDNR. OI locates and identifies physical, biological, economic, and social information on each unit of land (MDNR, n. d.). OI is expected to provide operational level information related to resource management issues regarding timber, wildlife, forest recreation, water quality, reforestation and land use. It provides the descriptive information at the stand level (the smallest record keeping unit) and a plan of operation for the stand after a multidisciplinary review of a preliminary prescription. Within OI, stands are grouped into compartments (group of stands with average area of 1500 to 3000 acres) based on proximity, common access, landform and soil properties and uniformity on distribution of major cover type acreages (MDNR, n. d.). The 01 definition of forest is very similar to FIA, it identifies a stand as a forest when it has at least 16.7% stocked and land capable of producing 20 cubic feet per year (Pedersen, pers. comm.). Compartments are grouped by year-of-entry (YOE). The 01 analysis proceeds by compartment within a given YOE. Each YOE contains approximately 10 percent of the compartments in a forest area (MDNR, n. d.). By the end of Fiscal Year 2004, the inventory of compartments with YOE 2003, 2004, 2005 and 2006 was completed, and the boundaries of compartments and stands were digitized using a geographic information system (GIS). The inventory of whole forest area owned by MDNR will be completed in six more years. This research is limited to the compartments within these four YOE. The 01 has classified the forest type (referred to cover types) into 16 categories (Table 2). Field Sampled Vegetation of Natural Resource Information System (NRIS- FSVeg) The USDA-FS’s Field Sampled Vegetation of Natural Resource Information System (NRIS-FSVeg) combines a standard corporate database and computer applications designed to support field-level users. NRIS databases contain basic natural resources data in standard formats built to run within the Forest Service computing environment (NRIS FSVeg, 2005), and it provides agency personnel with the information needed to respond to public concerns and to address complex issues. Basically, it provides a diverse range of basic and calculated information in standard formats that can be shared across administrative and ecological boundaries. Table 2. Forest Type Classification in Operations Inventory 8 Cover Type Description Short Description A ASPEN (UPLAND) Aspen B PAPER BIRCH Paper Birch C CEDAR Cedar E SWAMP HARDWOODS Swamp Hrdwds SPRUCE-FIR (UPLANDS-INCLUDING UPLAND BLACK F SPRUCE) Spruce Fir H HEMLOCK Hemlock I LOCAL USE Local Name J JACK PINE Jack Pine M NORTHERN HARDWOOD Upland Hdwds O OAK Oak BALSAM POPLAR & SWAMP ASPEN and SWAMP WHITE P BIRCH Lowlnd Poplr Q MIXED SWAMP CONIFER Mx Swmp Cnfr R RED PINE Red Pine S BLACK SPRUCE-SWAMP Black Spruce T TAMARACK Tamarack W WHITE PINE White Pine Source: Operations Inventory Manual, MDNR (Unpublished) “ Non-forest classes include Tree bog, Grass, Rock, Lowland brush, Marsh, Upland brush, Bog or Muskeg, Other non-stocked or non-forest or non-productive land, Sand dunes, and Water. Field Sampled Vegetation (FSVeg), one of the resource modules of NRIS, provides guidelines for National Forest management planning. This component includes point and plot vegetation data from field surveys. Data on trees, surface cover, understory vegetation and down woody material are included in this component. The USDA-FS defines Forest land as “[l]and at least 10 percent stocked by forest trees of any size, including land that formerly had such trees cover and that will be naturally or artificially regenerated. Forest land includes transition zones, such as areas between heavily forested and nonforested lands that area at least 10 percent stocked with forest trees and forest areas adjacent to urban and built-up lands. The minimum area of classification for forest land is 1 acre. Roadside, streamside and shelterbelt strips of trees must have a crown width of at least 120 ft to qualify as forest land. Unimproved roads and trails, streams, and clearings in forest areas are classified as forest if less than 120 ft wide” (Smith et al. 2004). NRIS conducts ongoing strategic inventories, tactical inventories and stand examinations. The strategic inventory scale is national, the tactical inventory scale is for the Region or Forest, and the stand examination scale is for a project area, stand or vegetation condition. Basically, the stand examinations are one "type" of inventory conducted by the USDA-PS. Its purpose is to obtain the site and setting characteristics required to identify stand conditions and capabilities. The information may be collected by simple observations, or by formal, intensive examinations. The appropriate method to be chosen depends on factors such as stand complexity, the decisions to be made, and the purpose of the exam. Each examination method has varying data and accuracy requirements. However, stand examination data provide information for a wide variety of uses ranging from determining silvicultural treatments to evaluating wildlife habitat and modeling water yields. Integration of the stand level information, based on inventory design, produces national forest planning information. Without knowing the compatibility of scope, scale, and objective, there is a risk of introducing bias into a combined data set (NRIS FSVeg 2004). However, the design considerations and data acquisition guidelines prepared by NRIS are conceptual for national forest management planning rather than directional. The Ottawa, Hiawatha and Huron-Manistee national forests are located in Michigan. The forest cover classification used in the NRIS-FSVeg database of the national forest includes 60 forest cover types and has separate codes for each class (Table 3). Table 3. Forest Types Used by Michigan National Forests Code Forest type Code Forest type 1 Jack Pine 55 Northern Red Oak 2 Red Pine 57 Scarlet Oak 3 White Pine 59 Mixed Oak 4 White Pine-Hemlock 60 Oak-Hardwoods 5 Hemlock 63 Northern pin Oak 6 Scotch Pine 70 Sugar Maple-Black cheery 7 Norway Spruce 71 BI Ash-Elm-R Maple 8 White Spruce 74 White Ash 9 Conifers 76 Red Maple(Wet) 10 Spruce 79 Mixed Lowland Hdwd ll Balsam Fir-Asp-PB 80 Sugar-Maple-northern red oak 12 Black Spruce 81 S Maple-Beech -YB 14 Northern Wh Cedar 82 S Maple-Basswood 15 Tamarack 83 BI CH-W Ash-Y Pop 16 Wh-Sp-BF 84 Red Maple(Dry) 17 Upland BI Spruce 85 Sfiar Maple 18 Mix Swamp conifer 86 Beech - l9 Cedar-Aspen-PB 87 Sugar maple-beech/yellow 20 Northern Hdwd-Heml birch/red spruce 21 Mixed Northern Hdw 88 Black Locust 22 N. White Cedar- UP 89 Mixed Upland Hdwd 23 W. Spruce-BF-Aspen 90 Sugar maple-beech! basswood 24 Balsam Fir 91 Quaking Aspen 41 Wh Pine NRO-W Ash 92 Paper Birch 42 E. Red cedar Hardwood 93 Bigtpoth Aspen 43 Oak-Eastern white pine 94 Balsam Poplar 47 Oak-Aspen 95 Asp-W Spruce BF 48 Jack Pine-Oak 97 Lowland Brush 49 Red Pine-Oak 98 Upland Brush 53 Black Oak 99 Open 54 White Oak Source: NRIS FSVeg Data Dictionary Version 1.7 January 2005, Appendix (pp E6 and E-8) Forest Inventory Analysis (FIA) The primary objective of FIA is to determine the extent, condition, volume growth, and depletions of timber on the Nation’s forestland (Miles et al. 2001). The FIA’s continuing endeavor is mandated by Congress in the McSweeney-McNary Forest Research Act of 1928 and the Forest and Rangeland Renewable Resources Planning Act of 1974. Further, the 1998 Farm Bill required FIA to collect data on 20 percent of plots annually within each State (Miles et al. 2001). FIA defines forest areas as “[l]and at least 16.7 percent stocked by forest trees of any size, or formerly having had such tree cover, and not currently developed for nonforest use. The minimum area for classification of forest land is one acre. Roadside, streamside, and shelterbelt strips of timber must have a crown width of at least 120 feet to qualify as forest land. Unimproved roads and trails, streams or other bodies of water or clearings in forest areas shall be classified as forest if less than 120 feet wide” (Hahn and Hansen 1985). “The FIA inventory is based on aerial photo and/or remote sensing activity used to characterize the acreage of forest and non-forest land in the US. These classes are based on land-use. For forested land, more detailed classes are sometimes defined based on criteria such as forest type, volume per acre, stand size, stand density, ownership and/or stand age” (Miles et al. 2001). Then, ground plots are measured to adjust the remote sensing sample for changes since its data acquisition and to correct any rrrisclassification. "The remote sensing classification of these ground plots, together with the area estimates 10 from the remote sensing sample is used to assign area expansion factors to all ground plots” (Miles et al. 2001). These area expansion factors are used to weigh plot-level estimates when computing estimates for selected strata of the population. FIA plots are designed to cover a 1-acre sample area; however, not all trees on the area are measured. Recent inventories use a national standard, fixed-radius plot layout for sample tree selection. Ground plots may be new plots that have never been measured during a previous inventory. For all plots several observations are recorded for each sample tree, including its diameter, species and other measurements that enable the prediction of the tree’s volume, growth rate, quality, and forest health data. These tree measurements form the basis of the data on the tree records in the FIA database. According to the sixth FIA inventory online information, there were 45 forest types reported as classified by field crew for the public forest land in Michigan (Table 4). A variety of tools like maps, aerial photographs/imagery, and Global Positioning System (GPS) units are utilized to properly install the ground plots (Burkman 2005). Once a ground plot location has been selected on an aerial photograph, it is established and measured in the field. On all forested field plots, quantitative and qualitative measurements are made for conditions such as tree diameter, length, damage, amount of rotten or missing wood and tree quality, tree regeneration, site quality information, stocking, and general land use. And the general stand characteristics are gathered for forest type, stand age and disturbance, change in land use, general stand characteristics and estimates of growth, mortalities, and removals are gathered (Burkman 2005). ll Table 4. FIA Forest Type Classification Code Forest Type Code Forest Type 101 Jack pine 515 Chestnut oak / black oak / scarlet oak 102 Red pine 519 Red mafle / oak . 103 Eastern white pine 520 Mixed upland hardwoods 104 White pine / hemlock 700 Elm/ Ash / Cottonwood Group 105 Eastern hemlock 701 Black ash / American elm / red maple 121 Balsam fir 703 Cottonwood 122 White spruce 704 Willow 125 Black spruce 706 Sugarberry / hackberry / elm / green ash 126 Tamarack 708 Red maple / lowland 127 Northern white-cedar 709 Cottonwood / willow 380 Exotic Softwoods Group 800 Maple / Beech / Birch Group 381 Scotch pine 801 Sugar maple / beech / yellow birch 383 Other exotic softwoods 802 Black cherry 400 Oak Pine Group 803 Cherry / ash / yellow-poplar 401 White pine / Red oak / white ash 805 Hard maple / basswood 409 Other pine / hardwood 807 Elm / ash / locust 500 Oak Hickory Group 809 Red maple / upland 503 White oak / red oak / hickory 900 Aspen / Birch Group 504 White oak 901 Aspen 505 Northern red oak 902 Paper birch 507 Sassafras / persimmon 904 Balsam poplar 509 Bur oak 999 Non-Stocked 513 Black locust Problem Statement When there are multiple sources of forest resource information many users of these information sources try to make comparisons to meet their policy, planning and management needs. However, the adoption of different definitions for forest land and forest types makes it difficult to infer similar conclusions in some cases. For example, FIA and 01 define forest area as land at least 16.7 percent stocked by forest trees of any size or formerly having had such tree cover, and not currently developed for non forest use. IFMAP defines forest as land with the proportion of crown cover exceeding 25 % of the land area (MDNR 2003d). The difference has resulted in classification of a number of 12 non-forest types that are not delineated in FIA (Pedersen 2002). The 01 and IFMAP have non-forest types like T reed bog, Grass, Rock, Lowland/Upland Brush and Marsh. The other distinction between the IFMAP and NRIS-FSVeg/OI is that the former will identify a recent clear cut as the non-forest while the latter will consider it as forest. This is problematic when only a single date of imagery from all medium- to coarse-resolution sensors in a landscape where agriculture and forestry are interwoven (Wynne et al. 2000). IFMAP has classified forest land into the least number of forest types followed by 01 which has 16 categories. NRIS-FSVeg is classified into 60 types, and FIA has 45 forest types in public forest land in Michigan. FIA plot data can be viewed as the most precise classification scheme given rigorous data collection procedures at the plot level. Other schemes are used, but their relationship to FIA classifications is largely unknown. There are no studies which compare the IFMAP, OI and NRIS—FSVeg classifications with FIA classifications in Michigan. Taking FIA plot location as the point of reference for comparisons, I will provide information on the sirrrilarity and dissimilarity between forest classification approaches. The comparison will be helpful in explaining discrepancies in total public land acreage of different forest types computed from these data bases. Results from a comparative forest classification study may be useful developing recommendations to harmonize the available databases. Study purpose The purpose of this research is to compare classifications of forest types between forest resource databases for public lands in Michigan. The general research questions are: 13 “What are the classifications made by the four data sets? And how consistent are the four classification systems?” This study addresses the following objectives: 0 To compare and contrast the four forest resources inventory/database designs in forest type classification (FIA, IFMAP, 01 and NRIS-FSVeg), 0 To examine consistency among FIA, NRIS-FSVeg, and IFMAP classifications on national forest lands, 0 To examine consistency among FIA, 01, and IFMAP classifications on state forest lands, 0 To compare the agreement of IFMAP and FIA classifications on national and state forest lands, and 0 To explain differences in classifications using plot and stand level characteristics for the state forest lands. Organization of Thesis The thesis has five chapters. A synopsis of each chapter is presented in this section. Chapter 2, Literature Review, exarrrines previous studies on forest classification compared against ground truth plots. Previous studies comparing the accuracy of classification of satellite imagery and forest classifications are reviewed. Studies particularly relevant to Michigan land use are also reviewed. 14 Chapter 3, Classification Approaches, Data Sources, Study Area and Analysis Methods, presents background information on forest classifications for FIA, NRIS-FSVeg, 01, and IFMAP. Details on data sources regarding spatial locations and data attributes are summarized. Methods used to assess accuracy of the various information sources with reference to FIA sample plot information are described. Specifically, construction of error matrices, errors of omission, errors of commission, and Kappa statistics are the major tools used to classify the accuracy of the databases. Chapter 4, Results and Discussion, presents analysis of the error matrices and IFMAP, 01 and NRIS-FSVeg accuracy in classifying forest types on the publicly-owned forestland. Major findings, strengths and weaknesses of the comparisons and data limitations are discussed. In Chapter 5, Summary and Conclusions, policy, planning and management implications and several additional issues on forest classification are discussed. Among them are the definition of the forest area and inclusion of the non-forested land in classification systems. Also, policy recommendations to reconcile the forest classification in public forest land in Michigan are discussed. Additional research ideas are presented. 15 CHAPTER 2 LITERATURE REVIEW This study compares the forest type classification among the forest management planning databases of national forest, state-owned forest and the coarse-resolution state map taking the exact FIA plot location as the reference for comparison. The previous chapter introduced major public forest management databases in Michigan. This chapter reviews the land use classification systems broadly, the Michigan Resource Inventory System (MIRIS), the National Land Cover Database (NLCD) for the United States, forest classification of FIA/OI/ NRIS-FSVeg, FIA sampling, forest classification of IFMAP, digital imagery classification, IFMAP imagery classification, Remote Sensing (RS) based forest cover type studies, and studies related to accuracy measurement of RS imagery classification. Land Use Classification Existing use of land is the main characteristic for defining land use. Anderson et al. (1976) pioneered land use classification from remote sensing information and outlined criteria to effectively utilize the remote sensing information for land use and land cover classification. The land use classification developed by Anderson et al. (1976) has level I and level H classifications (Table 5). Information at levels I and H are a basis for national level or statewide aggregation. In addition, more detailed land use and land cover data, categorized at level III and IV, will be used more frequently by those who need and generate location information at the intrastate, regional, county, or 16 municipality level (Anderson et al. 1976). Further, land use and land cover classification level V can be added if a finer level of classification is desired. Table 5. Anderson et al. (1976) Land Use Classification Level I Level H 1 Urban or Built-up Land 11 Residential. 12 Commercial and Services. 13 Industrial. 14 Transportation, Communications, and Utilities. 15 Industrial and Commercial Complexes. 16 Mixed Urban or Built-up Land. 17 Other Urban or Built-up Land. 2 Agricultural Land 21 Cropland and Pasture. 22 Orchards, Groves, Vineyards, Nurseries, and Ornamental Horticultural Areas. 23 Confined Feeding Operations. 24 Other Agricultural Land. 3 Rangeland 31 Herbaceous Rangeland. 32 Shrub and Brush Rangeland. 33 Mixed Rangeland. 4 Forest Land 41 Deciduous Forest Land. 42 Evergreen Forest Land. 43 Mixed Forest Land. 5 Water 51 Streams and Canals. 52 Lakes. 53 Reservoirs. 54 Bays and Estuaries. 6 Wetland 61 Forested Wetland. 62 Nonforested Wetland. 7 Barren Land 71 Dry Salt Flats. 72 Beaches. 73 Sandy Areas other than Beaches. 74 Bare Exposed Rock. 75 Strip Mines, Quarries, and Grave Pits. 76 Transitional Areas. 77 Mixed Barren Land. 8 Tundra 81 Shrub and Brush Tundra. 82 Herbaceous Tundra. 83 Bare Ground Tundra. 84 Wet Tundra. 85 Mixed Tundra. 9 Perennial Snow or Ice 91 Perennial Snowfields. 92 Glaciers. Source: Anderson et al. (1976), Public Database Warehouse, USGS (http://www.wsdot.wa.gov/environment/envinfo/docs/RSPrj_USGS__lulcclass.pdf) l7 However “[t]he Level H is the fulcrum of the classification system as Level H can be created by aggregating the similar Level IH categories classification system” (Anderson et al. 1976). The U. S. Geological Survey (USGS) classification system provides flexibility in developing categorization at the more detailed levels and provides freedom to the users to develop categories that meet their particular needs. To retain the compatibility of the information, whatever categories are used at the various classification levels, special attention should be given to providing potential users of the data with sufficient information so that they may either compile the data into more generalized levels or aggregate more detailed data into the existing classes. Basically, the system satisfied the three major attributes process: (1) it gives names to categories by using accepted terminology, (2) it enables information to be transmitted, and (3) it allows inductive generalizations to be made (Anderson et al. 1976). This classification system is capable of further refinement on the basis of more extended and varied use. Regarding the forest land classification, Anderson et al. (1976) stated that “Forest lands have a tree-crown areal density (crown closure percentage) of 10 percent or more, are stocked with trees capable of producing timber or other wood products, and exert an influence on the climate or water regime”. Further, the authors noted when the “trees reach the marketable size”... they may be harvested and replanted and “there will be large areas that have little or no visible forest growth. The pattern can sometimes be identified by the presence of cutting operations in the midst of a large 18 expanse of forest”. Such areas should be included in the forest land category, “[u]nless there is evidence of other use”. And, “[l]ands that meet the requirements for forest land and also for an urban or built-up category should be placed in the latter category. The only exceptions in classifying forest land are those areas which would otherwise be classified as wetland if not for the forest cover. Since the wet condition is of much interest to land managers and planning groups and is so important as an environmental surrogate and control, such lands are classified as forested wetlands” (Anderson et al. 1976). At Level H, forest land is divided into three categories: Deciduous, Evergreen, and Mixed. According to the Anderson et al. (1976) classification, Deciduous Forest Land includes all forested areas having a predominance of trees that lose their leaves at the end of the frost-free season or at the beginning of a dry season. In most parts of the United States, these would be hardwoods such as oak (Quercus), maple (Acer), or hickory (Carya) and “soft” hardwoods, such as aspen (Populus tremuloides). Deciduous forest types characteristic of wetland, such as tupelo (Nyssa) or cottonwood (Populus), also are not included in this category. Evergreen Forest Land included all afforested areas in which the trees are predominantly those which remain green throughout the year. Both coniferous and broadleaf trees are included in this category. The coniferous evergreens are commonly referred to or classified as softwoods. They include eastern species such as balsam fir (Abies balsamea), various spruces (Picea), white pine (Pinus strobus), red pine (Pinus resinosa), jack pine (Pinus banksiana), and hemlock (Tsuga canadensis) (Anderson et al. 1976). Evergreen species commonly 19 associated with wetlands, such as tamarack (Larix laricina) and black spruce (Picea mariana), are not included in this category. Similarly, Anderson et al. (1976) identified Mixed Forest Land as all forested areas where both evergreen and deciduous trees are growing and neither predominates. When more than one-third interrnixture of either evergreen or deciduous species occurs in a specific area, it is classified as mixed forest land. Where the intermixed land use or uses total less than one-third of the specified area, the category appropriate to the dominant type of forest land is applied, whether deciduous or evergreen. Further to this, Forested Wetlands are wetlands dominated by woody vegetation. Forested wetland includes seasonally flooded bottornland hardwoods, mangrove swamps, shrub swamps, and wooded swamps including those around bogs (Anderson et al. 1976). As the forested wetlands can be detected and mapped by the use of seasonal, (winter/summer) imagery, and delineation of Forested Wetlands is needed for many environmental planning activities. For these reasons, they are separated from other categories of forest land. Though several classification approaches are available, the land-use and land-cover classification system devised for the USGS program, developed by Anderson et al. (1976), has become one of the most widely used classification systems for land use maps prepared by interpretation of remotely sensed images (Campbell 2002). 20 Michigan Resource Inventory System The Michigan Resource Inventory System (MIRIS) has two types of inventories about the State’s land and water resources: (1) a current use inventory to illustrate land cover and land use and (2) a land resource inventory which includes resources, unique areas, and areas hazardous to development (Goodwin et al. 2002). The first current use inventory was compiled from photo interpretation of color infrared aerial photography (1224,000 scale to 1 inch to 2,000 ft) obtained in 1978/79. Aerial photography obtained in 1985 was used for the inventory of Detroit and seven highly urbanized countries in Southeast Michigan. The second Michigan Land Cover/ Use Classification System (Division of Land Resource Programs 1981) is similar to the national system developed by the US. Geological Survey (Anderson et al. 1976). It is multi-level, hierarchical system that classified Michigan’s land cover/use into approximately 500 categories (Goodwin et al. 2002). The current use inventory is a subset (approximately 60 level I and H categories) of the Michigan land cover/use classification system (MIRIS, 1981). However, some of the category numbers and category definitions were changed. The MIRIS was upgraded in 2000 by using the 1998/1999 National Aerial Photography Program (NAPP) imagery, flown by the USGS for the entire state of Michigan (RS and GIS, 11. d.). These changes in MIRIS version H were made in order to correct problem areas that existed with the design and application of the earlier version. Changes in H version were based on several criteria. These criteria included: 21 (l) “describing the major components of each category group within the confines of a three-level hierarchy, (2) assigning map codes to categories in the lowest classification levels within each group, (3) creating separate categories for those items that may need to be cross-referenced under various user defined aggregation schemes and (4) maintaining clear distinction between “upland” and “wetland” natural cover types” (MIRIS 2000). In addition, the current use inventory of 22 sampled counties was supplemented with a detailed land component (level IV and V data which included species designation, stand size and stocking classification). National Land Cover Database (NLCD) for the United States In the last decade, a major provider of land cover information within the federal government has been the Multi-Resolution Land Characteristics Consortium GVIRLC) (Homer et al. 2004). The MRLC was originally formed in 1993, to meet the needs of several federal agencies: United States Geological Survey (U SGS), Environmental Protection Agency (EPA), National Oceanic and Atmospheric Administration (NOAA), and US. Forest Service (U SFS) for Landsat 5 imagery and land cover information (Loveland and Shaw 1996). For NLCD 2001 land cover classification, a method that optimally classifies many database layers in a single step, with the ability to document this relationship in a rule base was highly desirable, and the decision tree classification method was chosen (Homer et al. 2004). The authors used the commercial decision tree program C5©. They claimed that this decision tree classification provided an efficient, robust method for classifying large quantities of 22 information in documentable form, and moreover, it allowed them to export mutually exclusive rules generated by the classification into generic textual rule sets allowing users access to classification parameters. NLCD 2001 defined land cover into 29 land cover classes. And forest cover is defined as “areas dominated by trees generally greater than 5 meters tall, and greater than 20 percent of total vegetation (Homer et al. 2004). This database is developed using the Mapping Zone approach, with 66 Zones in the continental United States and 23 Zones in Alaska. IFMAP Classification IFMAP has a hierarchical scheme of classification of land use. The hierarchical scheme contains various levels from very broad categories to more detailed ones. Pacific Meridian Resources (PMR 2001a) stated that the objectives of the classification scheme of IFMAP are: (1) to provide useful land cover labels for forest and wildlife management, (2) to provide suitable strata to support stratified inventory data, and (3) to generate land cover information for use by managers and researchers. In addition, the IFMAP classification kept “the number of classes of land use to a minimum, and ensure that classes agree with the definitions of the other ecoregions” (PMR 2001a). The rule established for this classification system was that each class should have a local management objective. The vegetation classification rules were based on percentage of the ground covered by specific vegetation covers. Vegetation was broken into woody and non-woody with the definition of woody being a plant that contains a secondary xylem (dicotyledons and conifers), then into shrub and tree. The 23 cut off point between forest and non-forest, and vegetated and non-vegetated was set as 25% of ground covered by canopy. The IFMAP classification is very close to the Gap Analysis natural terrestrial cover classification system. In fact, it is a modified UNESCO natural terrestrial cover classification that has six hierarchical levels namely: forests, woodlands, shrublands, dwarf-shrublands, grasslands, and barren (Jennings 1993). The Gap Analysis classification defined forest as areas dorrrinated with a total canopy cover of 61% or more, trees crown usually interlocking; woodlands as areas dominated by trees with a total canopy cover of 26 to 60%, most of the trees not touching each other; shrublands, dwarf—shrublands and grassland as areas with less than 26% total canopy of tree; and barren land as areas with vegetation cover less than 5% (Jennings 1993). At level H, the Gap land use classification combined the land with morphologically similar main vegetation into a class. For the classes of forests, woodlands, shrublands and dwarf- shrublands, the similarities were based on evergreen, deciduous and xeromorphic characteristics (Jennings 1993). The IFMAP level 11 classification has adopted the 60% rule of canopy cover between coniferous and deciduous so that only those stands that were neither dominated by coniferous nor deciduous fell into mixed stands (PMR 2001a). The Level IH classification of IFMAP was developed based on the majority of a particular tree species in a forest stand. And the classification scheme has been viewed as a series of sequential if-then statements. The detail of the IFMAP classification can be accessed in the Michigan Geographical Data Library website (http://www.mcgi.state.mi.us/mgdll) 24 IFMAP uses imagery obtained from the TM 5 and ETM 7+. To understand the IFMAP imagery classification it is important to review the remote sensing techniques of imagery classification, after first describing on-the-ground management and research databases like FIA, 01, and NRIS-FSVeg. Forest Classification of FIA/OI/NRIS-FSVeg FIA and the forest management planning operational databases, 01 and NRIS-FSVeg, provide more detailed forest classification (See Chapter 1). These databases have Level IVN classification. IFMAP has information up to Level HI land use classification (SI 2004). According to the FIA National Core Field Guide Version 2.0 (2004), the forest types of the Continental US. and Alaska have been classified into 28 forest groups, and there are 140 types that best describe the plurality of stocking for all live trees. Similarly, based on online FIA condition table information, the public forest of Michigan has 14 forest groups and 45 distinct forest types (Table 4). FSVeg has classified the national forests of Michigan into 60 forest types, including lowland brush, upland brush and open as a type (Table 3). OI has classified state forests into 16 forest types (Table 2). 01 has defined treed bog, grass, rock, lowland brush, upland brush, marsh, bog, musk, sand dunes, water and other non-stocked or non-forest or non-productive land as non-forest classes. There are differences in the minimum percentage of stocking required to define an area as forest among various databases. The minimum level of forest tree stocking 25 requirement is 10% for FSVeg (Smith et al. 2004), 16.7% for FIA (Hahn and Hansen 1985) and for 01 (Pedersen, per. com..), 25 % for IFMAP (Michigan Geographical Data Library, SI, 2004). Thus there are big differences in inclusion and non-inclusion of areas with lower levels of forest tree stocking. For example, the NRIS-FSVeg has forest type as lowland brush and upland brush and 01 and IFMAP has defined the lowland brush, upland brush, treed bog, as non-forest areas. FIA Sampling For this study, FIA information is serving as the ground truth for assessing the accuracy of forest type classifications of the databases. Therefore, it is important to understand the nature of FIA. With passage of the 1998 Farm Bill, formerly known as the Agricultural Research, Extension, and Education Research Act of 1998 (PL 105- 185), Congress required that the Forest Service conduct annual forest inventories in all states (McRoberts 1999). This Bill made some changes in FIA sampling procedures and intensity. For example, the Farm Bill established requirements that (1) each year, 20 percent of total plots are to be measured in each eastern state and 10 percent of plots are to be measured in each western state, (2) the annual data are to be made available each year, and (3) statewide resource reports are to be published every five years with integration of FIA and the Forest Health Monitoring (FHM) program (McRoberts 1999, Brand 2005). FI-IM is a national program that uses data from 26 ground plots, aerial surveys, and other sources to produce annual estimates of status changes and trends in indicators of health. FIA inventories are commonly designed to meet the specified sampling errors at the State level at the 67 percent confidence limit (Miles et al. 2001). FIA precision standards require sampling intensity of one plot for approximately every 6,000 acres in the North Central region (McRoberts 1999). To satisfy this requirement, the geographical hexagons established for the FHM programs were divided into 27 smaller FIA hexagons, each of which contains approximately 5,900 acres. A grid of field plots was established by selecting or establishing a plot in each smaller hexagon: (1) if an FHM plot fell within a hexagon, it was selected as the grid plot; (2) if no FHM plot fell within a hexagon, the plot from existing network of permanent FIA plots that was nearest the hexagon center was selected as the grid plot; and (3) if neither the FHM nor the existing FIA plot fell within the hexagon a new permanent FIA plot was established at the hexagon center and selected as the grid plot (McRoberts 1999, Brand 2005). The grid of plots is called the federal base sample and is considered as an equal probability sample. In this way, FIA uses grid sampling (Schreuder et al. 2003) that covers a l-acre sample area (Miles et al. 2001). Recent inventories use a common sampling design consisting of four 24.0-foot radius subplots (approximately (1/24th acre) for trees at least 5 inches in diameter and four 68me radius microplots (1/300th acre) for smaller trees (Miles et al. 2001). Another characteristic of the new design is the mapping of differing forest conditions. If two or 27 more conditions occur within a plot, the boundary between them is mapped and the proportion of the plot in each condition is recorded. Forested plots are installed and measured regardless of intended use or any restrictive management policy. After the adoption of the national plot design in the mid 1990’s, all FIA units have implemented a common sampling design consisting of four 24.0- foot radius subplots (approximately 1/24th acre) for trees at least 5 inches in diameter and four 6.8-foot radius microplots (approximately 1/300th acre) for smaller trees (Miles et al. 2001). In this way, tree expansion factors are approximately 6 for trees at least 5 inches in diameter and approximately 75 for smaller trees. Subplot l is the center of the cluster with the other three subplots located 120 ft away at azimuths of 360°, 120°, and 240°, respectively (Miles et al. 2001). In addition, the temporal regularity was incorporated by systematically assigning each hexagon to one of 5 inter-penetrating panels. Plots located in panels 1 to 5 hexagons were to be measured in first to fifth years, respectively. Once the five- or ten-year cycle is complete, the sequence starts again. In fact, FIA inventories are extensive inventories that provide reliable estimates for large sampling areas. As data are subdivided into smaller and smaller areas, such as geographic unit or a county, the sampling errors increase and the reliability of the estimates goes down. There are nine tables in the FIADB (Forest Inventory and Analysis Database) Version 1.0. In this study Condition table and Plot table information is used. 28 Digital Imagery Classification Digital image classification is the process of assigning pixels to classes (Campbell 2002). The aim of the classification is to assign each pixel in an image to a distinct cover class or theme, broadly called an “information class” (Foody 2003). By comparing pixels to one another, and to pixels of known identity, it is possible to assemble groups of similar pixels into classes that are associated with the informational categories of interest to users of remotely sensed data (Campbell 2002). The informational classes are the categories of interest to the users of the data. Informational classes are, for example, different kinds of forests or land uses that convey information to the remotely sensed data users (e. g. policy planners, resource managers, the scientific community and publics). Unfortunately, the information classes are not recorded directly on remote sensing images; they can only be derived indirectly by using evidence contained in the brightnesses recorded by each image. A group of pixels that are uniform with respect to the brightnesses in their several spectral channels is called a spectral class (Campbell 2002). Thus, remote sensing classification proceeds by matching spectral categories to information categories. In general, each pixel is assigned into the class that it most nearly resembles or to which it is closest, according to some measure of distance in a broad sense. In this way, information classes are typically composed of numerous spectral subclasses. 29 Unsupervised classification is defined as the identification of natural groups, or structures, within spectral data (Campbell 2002). In unsupervised classification these natural groups are defined, identified, labeled and mapped. On. the other hand, Campbell (2002) informally defines supervised classification as the process of using samples of known identity (i.e., pixels already assigned to informational classes) to classify pixels of unknown identity (i.e., to assign unclassified pixels to one of several informational classes). Samples of known identity are obtained from those pixels located within training areas, or training fields. Pixels located within these areas from the training samples are used to guide the classification algorithm to assign specific spectral values to appropriate informational classes (Campbell 2002). However, there is no unique criterion to use as the basis for classification; different criteria yield different classifications resulting in a trade-off between accuracy and efforts (Foody 2003). Chuvieco and Congalton (1988) developed a hybrid method which combines the training statistics generated from both supervised and unsupervised classification approaches using two multivariate statistical techniques: cluster and discriminant analysis. This method was used to classify the pixels for IFMAP. In this method, the cluster analysis is used to group together training statistics generated from both classification approaches. The strength of these groupings was then tested using discriminant analysis. “The cluster analysis is not simply a reduction process for the unsupervised approach, but rather a way of combining similar groupings from both the supervised and unsupervised approach and the groupings that contain both supervised 30 and unsupervised statistics provide a powerful match between the informational categories and the spectral classes” (Chuveico and Congalton 1988). The cluster analysis proposed by the authors uses the hierarchical method in which the classes are merged progressively, two in each step, until all classes belong to the same cluster or group. This grouping can result either in a merging of two single classes, or in a class being merged with an already formed grouping, or in merging two groupings. At the end of the process, the user selects the step at which the clustering stops, usually when two classes very remote in the distance matrix are merged. The squared Euclidian distance is selected to calculate the distance between the classes. Once the cluster analysis of the training statistics is completed, discriminant analysis is used to evaluate the strength of the grouping. Discriminant analysis is a multivariate technique that attempts to find a new set of functions which maximize the ratio of the variance between and within groups (Chuvieco and Congalton 1988). The analysis involves a linear transformation of the original variables, which are orthogonal, in such a way that the new functions maximize the separation between the already formed groups. After these functions have been found, it is possible to regroup each one of the original training classes to test its membership in the correct grouping. After running a discriminant analysis on the groups formed by the clustering of the training statistics, each one of the final groups was defined by the average value of its members. Then these average statistics were input into the classification algorithm to be used in the assignment phase of the 31 classification process. In this way, by the clustering and discriminant analysis, “an improvement in the classification results is expected because of the improved grouping of training statistics” (Chuvieco and Congalton 1988). In IFMAP, classification schemes were first developed for the Southern Lower Peninsula in the spring of 2000, and then updated to reflect the differing ecoregions of the Northern Lower Peninsula and the Upper Peninsula in the winter and spring of 2001 (PMR 2001a). Imagery were acquired for different seasons namely, spring (leaf- off imagery), summer (growing season) and senescence imagery (fall images). The images were used to enable separation of species based on their phenological differences, that is, different cropping cycles and moisture regimes that allowed differentiation of these categories (e. g., aspen from northern hardwoods and from oaks). A listing of the classification schemes and the rules developed toderive the classes can be accessed via the Michigan Spatial Data Library website (http://www.dnr.state.mi.us/spatialdatalibrary/sdl2/land_use_cover/2001/ IFMAP_lp_landcover.htm). The acquisition of imagery from Landsat 5 and Landsat 7 sensors for the rest of the state of Michigan took place in Fall 2001. In 2002, imageries from both sensors were collected in leaf-off, mid-season, and senescence for the Northern Lower Peninsula and the Upper Peninsula. A stratified approach was adopted to obtaining the training data. The training data were collected comprising reflectance samples, taken from the images to be classified of the cover types to be classified. These reflectances may 32 differ from region to region, because of phenologic differences, soil variability, hydrologic differences, and elevation changes, though the latter was not a big factor in the State of Michigan (PMR 2001a). To obtain an adequate stratification of training samples, a coverage of ecosystems was intersected with a coverage of the TM scene boundaries, and the resulting polygons were named eco-scenes (PMR 2001a). Eco- scenes of interest were identified and specific areas were chosen for field visits. Since the IFMAP project is specifically concerned with gaining information on forested lands, efforts were made to identify areas of high natural land cover diversity. Training fields are areas of known identity delineated on the digital image, usually specifying comer points of a square or rectangular area using line or column numbers within the coordinate system of the digital image. Specific training areas need to be identified for each informational class following the standard guideline of image classification (Campbell 2002). The key characteristics of training areas are number of pixels, size, location, number, placement, and uniformity (Campbell 2002). According to PMR (2001a), while performing the training of IFMAP imagery, the raw information, such as the canopy closure of the three size classes, various shrub, wetland, and herbaceous species, were recorded from training sites. Field data were digitized onto the imagery using the air photographs and the manual delineations (on the mylar overlays) as guides. The supervised training sites were extracted from this image using a region-growing method. The region growing method ensured a spectrally pure signature, which was appropriate for the supervised 33 classification. It involved specifying a spectral threshold for inclusion in the area of interest (AOI), which eliminated spectrally outlying portions of the sites. After digitizing, the training sites were used to recode a single band. of a TM mosaic, with unique identifiers for each training sample. This layer was used to extract training statistics from any of the imagery mosaics and/or derivatives without re-drawing the training sites. As the atmosphere can substantially impact the signatures derived from imagery, scenes were selected that had a minimum of haze and clouds and no atmospheric correction was made. Images of Southern Michigan were divided into areas with similar date scenes, usually taken on the same date. They were mosaiced, and areas that represented different conditions on the ground were classified separately (PMR 2001a). The primary classifier was a cluster analysis method developed by Chuveico and Congalton (1988), which matched clusters from an unsupervised classification with the training site data collected by the field crews. In the first iteration, the summer image mosaic is classified using the ISODATA unsupervised method (PMR 2001a). The resulting classes were exarrrined with the supervised classes using an agglomerative hierarchical procedure by the application of Mathsoft 2000 (PMR 2001a). This procedure begins by considering each signature, the pixel reflectance value, as a separate group; then it combines and divides groups based on spectral similarity until all signatures are in a single group, displayed in a hierarchical structure according to the order in which the groups were merged or divided. The resulting clustering tree is then examined for matches of supervised and unsupervised 34 signatures, and in the case of a close one-to-one relationship, the unsupervised signature in question can be labeled with the land cover label of the supervised signature. After the tree is fully examined for clusters of this type, the labeled classes are subset from the imagery, and the procedure is run again. This procedure continues until it is no longer advantageous to do so. The cluster analysis method was used initially on the entire image to achieve an Anderson Level I classification, and subsequently on the individual level I cover types to classify the land cover to the desired level. Once the imagery was subset to level I, the signatures derived from the individual subsets will contain less overall spectral diversity, and as a result the subtle differences were more evident and therefore easier to separate. Level H classifications were based on analysis with leaf off imagery and Level HI classification were derived using different masks and use of texture bands. Remote Sensing-Based Cover Type Mapping Studies Ross Nelson and his colleagues (1987) developed a method to assess the continental forest land cover using Landsat Multispectral Scanner (MSS) data. In their study, the authors assessed the Anderson et al. (1976) level H classification for conifer and hardwood forest area in the continental United States. The authors used a stratified random sampling (SRS) approach to allocate Landsat MSS scenes country-wide so that the aerial extent of the conifer/hardwood resources of the United States could be evaluated (Nelson et al. 1987). The authors generalized a priori information in the 35 form of a major forest cover type map (USGS 1980) and used it to stratify the country into six forest strata, namely northern conifer, northern hardwood, central hardwood, southern conifer, western conifer, pinyon-juniper and one non-forest strata. The MSS scenes were allocated to each forest stratum with the constraint that each contain at least two MSS scenes (Nelson et al. 1987). Within each MSS scene, four 200 X 200 pixel sample blocks were located systematically, and each block’s forest cover was identified. The authors’ selection of fewer large blocks was derived from their preliminary study. The sample blocks were first located in the Landsat MSS data and on 1:250,000 USGS topographic map sheets and to facilitate the acquisition of National High Altitude Photography (NHAP). NHAP are color infrared photos flown at a scale of 1:58,000. These photos were used to help identify spectral classes in MSS data and to refine the digital land cover classifications. The authors used most of the photos which were acquired leaves off, facilitating conifer/hardwood identification. A multicluster blocks procedure was used to classify the MSS data. In this method, for a given scene, small blocks of MSS digital data ranging from 40 X 40 to 60 X 60 pixels were clustered into spectral classes using an unsupervised classifier using IDIMS (Interactive Digital Image Manipulation System, ESL 1978). Each spectral class was identified using ancillary data (NHAP, B&W photo quad, false CHI composites) (Nelson et al. 1987). Forest was defined as any spectral class which included forested area >30% canopy closure. Conifer or hardwood forest was defined as a forested area where > 50% of the canopy was coniferous or hardwood, respectively. 36 The Landsat MSS data products and the statistical estimates of the conifer, hardwood, and water resources of the continental United States were evaluated to determine their reliability. The authors used the two different assessments to characterize the accuracy of this MSS classification. The first assessment compared the MSS classification and airphotos on a point by point (pixels by pixel) basis. The second assessment compared aerial estimates of the four cover types derived from the MSS data and from the corresponding areas on the airphotos. A third assessment was carried out to determine the accuracy of the MSS-based national estimates of conifer, hardwood, and water compared with national estimates generated by other US. government agencies. The findings of this study revealed that the national estimates of conifer and hardwood derived using this sampling method is within 3% of the total USDA Forest Service acreage. Comparison of the MSS classification products and airphotos showed that the conifer cover class was correctly identified 74% of the time and hardwood 80% of the time (Nelson et al. 1987). The average classification accuracy countrywide for the four types considered (conifer, hardwood, water and “other”) is 74%, the overall accuracy is 85%. Pax-Lenney et al. (2001) developed a generalized classifier method to monitor temperate conifer forests, ultimately at the global scale with Landsat TM and ETM+ data. Within this context, the generalization refers to a concept in which a classifier is trained with data from one domain, but applied to data from different domains (e. g., different geographic location, time, and /or imaging sensors). The authors mentioned that analytical methods based on image-by-image interpretation are too time- 37 consuming and labor intensive for studies of large areas to be undertaken with any degree of frequency. The authors found generalization is well suited for multitemporal classifications of one Landsat scene using simple dark-object-subtraction (DOS) atmospheric corrections to produce classifications with comparable accuracies as classifications from the more complex radiative transfer corrections, based on over 200 classifications. However, the high degree of variability in the classification accuracies underscores the importance of extensive, in-depth analysis of remote sensing techniques and applications, and highlights the potential problem for misleading results based on just a few tests (Pax-Lenney et al. 2001). Miguel-Ayanz and Bi ging (1997) compared the performance of TM and SPOT data for cover type mapping on the Central Sierra of Spain. The authors used three single- stage and one multistage iterative classification in their study. The three-single stage classifications were: (1) supervised classification with band selection by spectral analysis, (2) supervised classification with band selection with spectral separability indices, and (3) supervised classification with prior probabilities and band selection by spectral separability indices. The multi-stage supervised classification was performed using an iterative classification method. In this method the class that attains the highest average accuracy (Congalton 1991) in each iteration is masked and classified as part of the image. In this way, GIS analysis was used to obtain the prior probabilities for classes being discriminated and these probabilities were used in the band selection and classification processes. However, prior probabilities did not significantly improve the band selection process. In most cases the best band 38 combinations were selected both by the weighted (including the prior probabilities) and nonweighted separability indices. The best overall accuracy, 66%, was obtained for TM data with the iterative classification approach. Accuracy of 61% was the best overall accuracy for SPOT data, which was obtained with the iterative classification methods. For TM imagery, the five most abundant classes, which account for over 72% of the study area, where classified with 90% overall accuracy. Similarly, Wang et al. (2003) compared the dry season ETM+ and 1—m panchromatic sharpened IKON OS imagery classified as tree canopies and open area taking the latter imagery as valid ground truth to assess the tropical deforestation in the in the Amazonian state of Mato Grosso, Brazil. The authors found the squared correlation coefficients (R2) between the canopy cover values derived from ETM+ and H(ONOS were 0.92 and 0.96 at the 30-m and 90-m scales respectively. Thus, the authors argued ETM+ imagery can be used to estimate canopy cover across large areas of tropical forest. Silbemagel et al. (1997) compared the distribution of the landscape measure among landtype association groups in historic (1840’s) and present (1990-1992) landscapes in the Eastern Upper Peninsula of Michigan comprising six counties: Alger, Chippewa, Delta, Luce, Mackinac and Schoolcraft. In addition, the authors compiled quantitative information on landscape metrics, to supplement existing qualitative descriptions of landtype associations (LTA) in the study area. Cover type boundaries between each section line were interpolated using elevation lines, surface geology maps, and other 39 early vegetation maps. The authors used the cover classes based on expanded MIRIS land cover codes. Prevalence or dominance of cover classes was based on class area (CA) and landscape similarity index (LSIM), or class area weighted by total landscape area. “In the four physiographically based land type association groups studied: bedrock-controlled, lowland sand lake plain, morainal origin, and outwash-northem hardwoods and mixed conifer were most prevalent cover types of the 10 studied, historically and currently” (Silbemagel et al 1997). Northern hardwoods were especially prevalent in the moraine groups, while the mixed conifer type was more prevalent in the bedrock group. Wetlands and mixed pines, in addition to northern hardwoods, were also prominent in the lowland group. In the outwash group, these types were also present, but mixed and white pine were more prevalent. Largest single patches (LPI) were found redundant to the LSIM, and therefore were not assessed to the same extent as other indices. The highest LPI values were found in the northern hardwoods in the moraine groups. Skole et al. (2002) developed a model “Forecast Michigan” to forecast the land use or urban sprawl in the context of spatial decision system support. The authors claimed, “the Forecast Michigan models are process models using most of the state’s standard GIS data layers, as well as inputs from economic and demographic models.” They also utilized network analysis algorithms to include transportation routing and traffic demand with enough spatial resolution and sensitivity to provide transportation planners a way to evaluate different corridor alignments and access points in the context of secondary and cumulative impacts on land use change in a project region. 40 There are a large number of studies about detailed forest classification, see for example, Saatchi and Rignot (1996), Mayaux and Lambin (1997), Martin et al.(l998). However, they are based on high spectral resolution remote sensing data like Synthetic Apertures Radar (SAR), NOAA’s Advanced Very High Resolution Radiometer (AVHRR) and Airbrone Visible/Infrared Irnagin g Spectrometer (AVHlIS), respectively. Accuracy Measurement Methods Used for Remote Sensing Imagery Classification Stehman (1999) reviewed several basic probability sampling designs useful for accuracy assessment. According to the author, the first step in choosing'the appropriate sampling design is to “define the population for which the accuracy assessment is needed, and to determine if this population will be partitioned into pixels, polygons, or some other aerial unit. Then the probability sampling design forms the statistical foundation of the assessment.” Basically, “[c]hoosing a design from among a basic probability sampling design options should be guided by the project objectives and the relative importance of other remaining design criteria” (Stehman 1999). In addition, the criteria to consider when planning the sampling design are that “the sample should: (1) satisfy the probability sampling protocol, (2) be simple to implement and analyze, (3) result in low variance for the key estimates of 41 the assessment, (4) permit adequate variance estimation, (5) be spatially well distributed, and (6) be cost effective” (Stehman 1999). Congalton (1991) mentioned “researchers and users of remotely sensed data have a strong knowledge of both the factors needed to be considered as well as techniques used in performing any accuracy assessment”. The accuracy assessment task can be defined as one of comparing two maps, one based upon analysis of remotely sensed data (the map to be evaluated), and another based upon a different source of information (Campbell 2002). Basically, the second map is designated the reference map, assumed to be accurate, that forms the standard for comparison. The reference data are of obvious significance; if they are in error, the attempt to measure accuracy will be in error (Campbell 2002). The simplest method of evaluation is to compare the two maps with respect to the areas assigned to each category and the result of such comparison is to report the areal proportions of categories. These values report the extent of the agreement between the two maps with respect to total areas in each category, but do not take into account compensating errors in misclassification that cause this kind of accuracy measure to be itself inaccurate (Campbell 2002, Congalton and Green 1999). In addition, this form of error assessment is sometimes called non-site specific accuracy, because it does not consider agreement between the maps at specific locations, but only the overall figures for the two maps. The second form of accuracy, site-specific accuracy or classification error, is based upon the detailed assessment of agreement between maps at specific 42 locations (Campbell 2002). This computation is performed by comparing a sample of locations on the map with the same locations on the reference data and keeping track of the number of times there is agreement (Congalton and Green 1999). In the majority of analyses, the units of comparison are simply pixels derived from the remote sensing data, although if necessary, a pair of matching maps can be compared using any network of uniform cells (Campbell 2002). After the maps are evaluated on over all accuracy, the need to evaluate individual categories within the classification scheme is recognized, and so began the use of the error matrix to represent map accuracy (Congaltan and Green 1999). An error matrix is a square array of numbers set out in rows and columns which express the number of sample units (i.e. pixels, clusters of pixels, polygons) assigned to a particular category in one classification relative to the number of sample units assigned to particular category in another classification (Congalton 1991, Congalton and Green 1999). As noted, one of the classifications is considered to be correct (i.e. the reference data) and may be generated from aerial photography, airborne video, ground observation or ground measurement. The columns usually represent this reference data, while the rows indicate the classification generated from the remotely sensed data. In this way, the error matrix is a very effective way to represent map accuracy in that the individual accuracies of each category are plainly described along with both the errors of inclusion (commission errors) and errors of exclusion (omission errors) present in classification (Congalton and Green 1999, Congalton 1991). Sometimes “the error matrix is referred to as a confusion matrix because it 43 identifies not only overall errors for each categories but also misclassifications (due to confusion between categories) by a category” (Campbell 2002). In addition to clearly showing errors of omission and commission, the error matrix can be used to compute other accuracy measures, such as overall accuracy, producer’s accuracy and user’s accuracy (Story and Congalton 1986). Overall accuracy is simply the sum of the major diagonal (i.e., the correctly classified sample units) divided by the total number of sample units in the entire matrix and this is the most commonly reported accuracy assessment statistic and is probably most familiar to the readers (Congalton and Green 1999). Producer’s and user’s accuracy are ways of representing individual category accuracies instead of just the overall classification accuracy. The inspection of the error matrix only reveals the overall nature of errors present; there is often a need for more objective assessment of classification (Campbell 2002). For example, if we are interested to know “are the two maps in agreement?” —this is a question very difficult to answer without the help of just an error matrix. The notion of agreement is difficult to define and implement. The error matrix is an example of a more general class of matrices, known as contingency tables. Some of the procedures that have been developed for analyzing contingency tables can be applied to examination of the error matrix (Campbell 2002). Congalton (1981), Congalton et al. (1983) and Congalton and Green (1999) proposed application of techniques described by Bishop et al. (1975) and Cohen (1960), a discrete multivariate technique, as a measure of improving interpretation of an error matrix. A shortcoming of the usual error matrix is that even chance assignments of pixels to classes can result in surprisingly good results, as measured by percentage correct (Campbell 2002). Hord and Brooner (1976) and others have noted that the use of error matrix accuracy measures is highly dependent upon the samples, and therefore upon the sampling strategy used to derive the observations used in analysis. Kappa is a discrete multivariate technique used in accuracy assessment for statistically determining if one error matrix is significantly different than another G3ishop et al. 1975, Congalton and Green 1999). The result of performing a Kappa analysis is a KHAT statistic Observed — expected A k = 1 — expected There are numerous studies on the assessment of the accuracy of remotely sensed data using the error matrix that calculate overall accuracy, producer’s accuracy, user’s accuracy and the Kappa statistic. Several studies that are relevant to this study were found. Appropriate sample size requirement for the Error Matrix is one of the very important aspect, however “[a] balance between what is statistically sound and what is practically attainable must be found” (Congalton and Green, 1999). Further to this, the authors mention, a general guideline or good “rule of thumb” is to collect a minimum 45 of 50 samples for each vegetation or land cover category in the error matrix. However, for the larger area (i.e. more than a million acres) or the classification with a large number of vegetation or land cover categories (i.e., more than 12 categories), the minimum number of samples needs to be increased to 75 or 100 samples per category (Congalton and Green, 1999). Berlanga-Robels and Ruiz-Luna (2002) studied land use change mapping and change detection in the costal zone of Northwest Mexico using remote sensing. The authors used a multitemporal post-classification study with data from Landsat Multispectral Scanner (MSS) and TM to detect landscape changes. The authors compared four thematic maps (1973, 1986, 1990 and 1997) and classified the land-use into six classes as direct indicators of landscape condition. The accuracy of the classification (only in 1997 scene) was calculated from an error matrix, using overall accuracy assessment and the Kappa coefficients (Berlanga-Robels and Ruiz-Luna 2002). Lawson (n. d.) used the National Resource Inventory (NRI) data, a compilation of natural resource information on non-Federal land in the United States, with Landsat TM scenes in Iowa. The author demonstrated the use of NR1 point data for image classification and assessment of accuracy of a broad cover/use digital layer (TM scence) obtained from the Iowa Department of Natural Resources participating in the GAP program. The author used accuracy assessment tools including an error matrix, overall accuracy, user’s accuracy and producer’s accuracy. 46 Schreuder et al. (2003) made a number of recommendations for accuracy assessment of percent canopy cover, cover type and size class. The authors recommended to “well define vegetation types, stand size and canopy cover percentage to... [e]xplore the use of ambiguous classes, compute the contingency table and Kappa statistics for each of. . .. mapped categories, producers and users accuracy”. Kurvonen and Hallikainen (1999) studied the accuracy assessment of multitemporal ERS-l and JERS-l synthetic aperture radar (SAR) images in Finland test sites. The authors used the confusion matrix for land-cover type and forest type classification accuracy assessment. The authors mentioned that the use of textual parameters significantly improved the classification of land-cover and forest type classification. Liu et al. (2003) compared the neural networks and statistical methods in classification of ecological habitats using FIA data. The authors used two artificial neural networks (ANN) and three traditional statistical classification methods to classify FIA plots into six ecological habitats in the US. Northeast and found four variables (overstory, understory species composition, hardwood basal area percentage, and current FIA forest type) as the most important discriminating variables for habitat classification. In this study also the authors used classification accuracy, Kappa statistics, and a classification success index to compare the classification of ecological habitats. Wicham et al. (2004) assessed the thematic accuracy of the 1992 National Land Cover Data (NLCD) for the six western mapping regions of United States. The authors 47 collected the reference data in each region for a probability sample of pixels stratified by map land-cover class. The authors assessed the thematic accuracy using overall accuracy percentage and an error matrix for Anderson Level H classification. Wessels et al. (2004) compared the classification of the Moderate Resolution Irnagin g Spectroradiometer (MODIS) with the existing Landsat TM land cover maps as reference data for two major conservation areas (Greater Yellowstone Ecosystem- GYE, USA and the Para State, Brazil). In this study, the Landsat TM land cover was processed to their fractional composition at the MODIS resolution (250 m and 500 m). The authors used the error matrix, overall accuracy, producer’s accuracy and user’s accuracy to assess the accuracy of MODIS thematic maps. The findings of this study suggested, in GYE, the MODIS land cover was very successful at mapping extensive cover types (e. g. coniferous forest and grasslands and far less successful at mapping smaller habitats (e. g. wetlands, deciduous tree cover) that typically occur in patches that were smaller than the MODIS pixels. For Para State it was successful at producing a regional forest/non-forest product (Wessels et al. 2004). However, a single 500 m MODIS forest/non-forest product cannot be expected to reflect all the complex human impacts on biodiversity such as secondary regrowth, local land-use matrix dynamics and low intensity logging. Similarly, Powell et al. (2004) studied the sources of error in an accuracy assessment of thematic land-cover maps in the Brazilian Amazon. The authors tried to quantify the subjectivity in reference data labeling and compared reference data produced by 48 five trained interpreters. In addition, the authors identified the impact of other error sources, including geolocational errors between the map and reference data, land- cover changes between the dates of data collection, heterogeneous reference samples, and edge pixels. By the findings of this study the author suggested “ (1) labels of continuous land-cover types are more subjective and variable than the commonly assumed, especially for the transitional classes]; (2) validation data sets that include only non-mixed, non-edges samples are likely to result in overly optimistic accuracy estimates, not representative of the map as a whole” (Powell et al. 2004). Katila et al. (2000) introduced a statistical calibration method aimed at reducing the effects of map errors on multisource forest resource estimates. The authors developed a correction method based on the confusion matrix between land use classes of the field sample plots and the corresponding map information with the empirical example from the ninth National Forest Inventory of Finland. There is a key concern that “the land cover maps derived are often judged to be of insufficient quality for operational applications” (Foody, 2003). For this reason, there is a need to compare the forest management databases available in Michigan. However, comparing the different forest management databases/maps, generated for specific management purposes, is a challenging task because of differences in objectives, spatial resolution and in the definitions of forest type classification. Different government entities like the MDNR, national forests and Forest Service ' However, using multiple interpreters to produce the reference data classification increases reference data accuracy. 49 research are using their own forest type classifications based on their management objectives. In this study, IFMAP is a thematic map and the other databases like FIA, 01 and NRIS-FSVeg are developed based on field observation. In other words, the FIA, 01 and NRIS-FSVeg are the databases developed from on—the-ground observation and IFMAP is generated from space observation. In addition, the spatial resolution of the each database is different. For example the 01 spatial resolution is 900 m2 and FIA is 4 X 1/24th acre. The NRIS-FSVeg and 01 spatial resolution are based on timber stand sizes, the smallest forest management unit. Objectives of the NRIS-FSVeg and 01 database are more or less the same; however, they are not spatially overlapping because of different ownerships. In addition, the IFMAP is a coarse-resolution map, derived from Landsat TM with cluster analysis (Chuvieco and Congalton 1988) using FIA data as the ground truth. A number of authors (see for example, Katila et al. 2000, Wessels et al. 2004, Wicham et al. 2004, Kurvonen and Hallikainen 1999, Lawson, n. d.) have compared the accuracy of thematic maps with the other thematic maps, topographic maps, or ground truth data collected for different objectives. The review of previous studies on the accuracy of thematic maps suggests that, regardless of differences in various databases, the comparison of the accuracy using the standard accuracy assessment statistics like overall accuracy, producer’s accuracy, user’s accuracy and the Kappa coefficient of agreement can be used for this study. The rigorous field data collection of FIA and its exact plot location provide a sound basis for comparative analyses for a large number of forest types representing the majority of the forest area of Michigan. 50 CHAPTER 3 DATA SOURCES, STUDY AREA, CLASSIFICATION APPROACHES AND ANALYSIS METHODS This research was a collaborative project among the Department of Forestry at Michigan State University, the Michigan Department of Natural Resources, national forests of Michigan and USDA-FS Forest Inventory Analysis unit at the North Central Research Station. Details of the data sources, study area, analysis methods are presented in the following sections. Data Sources Forest resource management and planning databases were obtained from the USDA Forest Service and the Michigan Department of Natural Resources. The NRIS-FSVeg information was obtained from the Huron-Manistee National Forest, Hiawatha National Forest and Ottawa National Forest. 01 data for state-owned forests, which included the compartments with YOE 2003 to 2006 available in digitized GIS environment, were obtained from the Forest, Minerals and Fire Management Division of the MDNR. The national forest data were in vector GIS and were re-projected into Michigan Georef State Plane Coordinate System 1983 using the Projection function of ArcToolbox of ArcGIS Desktop (8.2), (Environmental Systems Research Institute, Redlands, California, USA). The OI data were already in Michigan Georef State Plane Coordinate System 1983. 51 Similarly. IFMAP, land cover 2001 which was in image format, was downloaded from the Michigan Spatial Data Library (SDL). The image file of IFMAP was transferred to the grid file using the Irnport-Export function of Imagine 8.7 (Leica Geosystems GIS & Mapping, LLC, St. Gallen, Switzerland ERDAS). IFMAP is in Michigan Georef, so the raster data was not re-projected. IFMAP land cover 2001 satellite imageries were taken during 1997 to 2001. The data fields of the NRIS-FSVeg, national forest database, and 01, state-owned forest were carefully reviewed. Initially, a limited number of the stand parameters were chosen which may be useful in explaining differences/similarities of the forest type among the databases. Four key attributes of the stand were selected for each database. From NRIS- FSVeg the selected stand attributes were size density, stand dbh, stand age and survey year. From OI, the stand attributes of size density, stand age class, total basal area and understory type were selected. All the selected attributes were aggregated (generalized) to help maintain confidentiality of FIADB. By law, the USDA-PS must protect the confidentiality of FIA plot locations. Spatial location of forest for both NRIS-FSVeg and 01 were aggregated to broader categories. Similarly from FIADB, two plot-level information variables were selected, namely: (1) measurement year, and (2) kind code, and three attributes of stands were selected from the condition table, namely: (1) code for forest type of the condition (assigned by field crew), (2) aggregated stand age-the average total age, and (3) growing stock stocking code. The Ecological subsection code, forest types derived from an FIA algorithm and 52 attributes information about present level of stocking similar to stand dbh from NRIS- FSVeg and total basal area from OI were initially selected. These attributes were dropped later from the analysis due to confidentiality concerns. The revision of the attributes useful in explaining differences in classification results was based on the potential problems in maintaining the confidentiality of FIADB. In the end, only the equivalent of forest type, from three databases, was selected. Study Area Out of the 19.28 million acres of forestland in Michigan, 7.14 million acres of forestland is owned publicly (Smith et al. 2004). Public ownership of forestland is distributed among the Federal government (National Forest, Bureau of Land Management and other), State government, counties and municipalities. In Michigan, there are three national forests, namely the Hiawatha, Huron-Manistee and Ottawa, covering about 2.68 million acres (Smith et al. 2004) (Figure 1). Forestland under State ownership is about 3.95 million acres (Smith et al. 2004). This study covers all national forests and the state forest land owned by the MDNR for which 01 information has been digitized. 53 Ottawa N atio nal Forest 33 Natlon al Forests - State-owned Forest, MDN R a, ‘ z; . , ” ‘ ,,.,.____ County boundary .2, . W» M" My 7. . "9‘“ .. 3 _ .1. ,J 50 0 50 100 150 Miles E Figurel. National Forests and Digitized State-Owned Forests in Michigan Classification Approaches When every information source or map or database has its own classification system, making comparisons between and among them is not straightforward. The classification of the reference database or map needs to be regrouped by making it more compatible 54 with other databases. In order to make forest classifications more compatible to each other, all the available forest types in each database were enumerated and a separate list of forest classification of each database was made (See Tables 1 to 4, Chapter I). Then pair-wise crosswalk tables were made by the author, and they were reviewed and revised based of expert judgment by experienced professionals working at Michigan State University (MSU, Drs. Donald Dickmann and Larry Leefers), the MDNR (Dr. Larry Pedersen and Mr. Jason Stephens) , and USDA-FS (Dr. Mark Hansen, NCRS and Mr. Joseph Gates, HMNF). Feedback and comments were incorporated to create the final crosswalk table (Appendix A). Analysis Methods After compiling agency data, the spatial and stand attributes were aggregated at the Forest Social Science and Economics Lab at Michigan State University (MSU). Methods to pick up data field from the grid and vector data were developed in the ESRI ArcView 3.2 environment. There were two types of data in this research. The NRIS-FSVeg and 01 were in vector data shape files, and IFMAP data was in image or raster data. To compile the forest classification of these two types of data set for a point location (e. g an FIA plot) two approaches were proposed. For vector data the Mila Grid extension was downloaded from the ESRI website, which can be used to pick up the grid value from underlying raster data; this picks up the forest type values from the raster layer (IFMAP). Similarly for vector data sets, a script was downloaded from the ESRI website, which can be used to pick up the data field from the vector data. In this way, two methods were proposed to 55 append the stand level data field like forest type, and the other variables selected to explain the difference in the forest types at the sample points. By overlaying the FIA plot vector data with the plot level and stand level information requested for this study as additional data fields to the FIA plot location, all the studied databases’ forest type and the other selected stand level attributes could be added to the FIA plot level record. In this study, Mr. Geoff Holden, North Central Research Station (NCRS), carried out this data compilation. Data extraction at the FIA plot locations was performed using two approaches depending on the data format. For vector data (01 and NRIS-FSVeg), a point in polygon overlay was performed using the "Intersect" tool in ArcGIS 9.0 to pick up the 01 and NRIS-FSVeg forest classification. For raster data (IFMAP), the "Extract Values to Points" tool (from the Spatial Analyst extension in ArcGIS 9.0) was used to pickup the IFMAP classification. In this way, Mr. Holden carried out theoverlay operation and combined the NRIS-FSVeg, 01 and IFMAP information at the exact location of the FIA permanent plots. The resulting database table had a plot number (sequential, l to n), the FIA field crew defined (FLDTYPCD) classification, the 01 or FS-Veg classification, and the IFMAP classification. Hence, each plot/record had data from three sources. In this study, the approach to study remote sensing thematic mapping classifications was adopted to examine the site-specific accuracy of classification of both the satellite imagery and the 01 and NRIS-FSVeg forest classification databases. 56 An error matrix compares the reference condition to the classified condition. Assume that n samples are distributed into k2 cells where each sample is assigned to one of k categories in map or database and independently (usually in the. rows), to one of the same k categories in the reference data set (usually the columns) (Figure2). i = rows (classification) Column total n+j j= columns (reference) row total 2 ni+ H11 H12 H13 nlk n1+ H21 H22 1123 112k n2+ H31 1132 1133 113k n3+ nkl 11kg nk3 nkk nk4- n+1 n+2 n+3 n+k n Note: Adopted from Congalton and Green (1999). Figure 2. Illustration of an Error Matrix or a Confusion Matrix Following Congalton and Green (1999), let ni- denotes the number of samples classified into categories i (i = 1, 2, 3,. k) in the remotely sensed classification or any classification under comparison (for e.g. OI, NRIS-FSVeg and IFMAP) and category j (j =1,2,3, ...... k) in the reference data set (FIA data). k Let ni+ = Znij (column sum) Fl 57 be the number of samples classified into category i in the remotely sensed k classrficatron, and n+ j = Zlnij (row sum) I: be the number of samples classified into category j in the reference data set. The overall accuracy between the defined classification (IFMAP, NRIS-NRIS-FSVeg and OI) and the reference data (FIA) was computed as follows: k 2 “ii Overall accuracy 2 1:1 (1) n Producer’s accuracy can be computed by n if Producer’s accuracy j: —— (2) And user’s accuracy can be computed by “if User’s accuracy ,- = (3) ni+ The user’s accuracy (UA) and producer’s accuracy (PA) can be used to calculate the error of commission and error of omission. Errors of omission (EO) refers to the samples of a certain class of the reference data that were not classified as such and errors of commission refers to the samples of a certain class of the classified data that were wrongly classified ( J anssen and Van der Wel 1994). 58 Campbell (2002) applied the reference classification column sum as the denominator to calculate the Errors of Commissions however, J anssen and Van der Wel (1994) published the following relationships User’s accuracy % = 100% - Errors of Commission (%), and (4) Producer’s accuracy (%) = 100% - Error of Omission (%). (5) A commission error is simply defined as including an area into a category when it does not belong to that category and an omission error is excluding that area from the category in which it truly does belong (Congalton and Green 1999). In this way, “every error is an omission from the correct category and a commission to a wrong category” (Campbell 2002). Cohen (1960) developed a coefficient of agreement (called Kappa statistic) for nominal scales which measure the relationship of two classifications beyond chance agreement to expected disagreement. This measure of agreement uses all cells in the matrix, not just diagonal elements. Again, let p ,1. denote the proportion of samples in the i,jth cell, then corresponding to n ,1. . In other word Pij = ”if, n. Then let pi, and p“. be defined by k Pi+ = Zpij and i=1 59 k p+j = Zprj i=1 The estimate of Kappa (K) is the proportion of agreement after chance agreement is removed from consideration, that is, K = (P0 _ pc) (6) (1'- pc) in which p0 = proportion of units which agree p, = proportions of units for expected chance agreement, and X,- po = Zpii , Pc = 291417”), pij : % + represents summation over the index (Rosenfield and Fitzpatrick-Lins 1986, Congalton and Green 1999) where N = total number of counts in matrix and X i}. = number of counts in ijth cell. For computation purposes k ”Znii - Zni+n+i i=1 [2 .-= (7) "2 " Zni+n+i with "ii , ni+ and n+i as previously defined (Congalton and Green 1999). The KHAT values are a measure of agreement or accuracy (Congalton and Green 1999). Cohen mentions I? = 0 when obtained agreement equals chance agreement. Positive 60 values of Kappa occur from greater than chance agreement; negative vales of Kappa are from less than chance agreement (Rosenfield and Fitzpatrick-Lins 1986). Congalton and Green note the KHAT values can range from +1 to -1. However, since there should be positive correlation between the forest management databases (NRIS-FSVeg, 01 and IFAMP) and the FIA, positive KHAT values are expected. The approximate large sample of Kappa is computed using the Delta methods as follows. )4 620— 612+) 2(1— 62X26262— 62) (1— 62) (24-40;) Var(K n (1" l92)2 (1" 492)3 (1 92)4 (8) 1 where 01 = — 211,-,- 1 k 91=—Z"ii "1:1 1 k k 64: "32 Zn nij(nj++n+i)2 i=1j=1 The variance of KHAT was calculated by using LabView 7.1 (National Instruments) Mr. Murari Regmi, a physics graduate student of MSU, wrote the program as per the author’s instruction. 61 The test statistic for testing the significance of a single error matrix is expressed by 131 A - (9) t/Varl K1 ’ ‘ Z is standardized and normally distributed. Given the null hypothesis H0 : K1 = 0, and the Z: alternative Hl : K 1 at 0, Ho is rejected if Z 2 20,,2 , where 012 is the confidence level if the two-tailed Z test and the degrees of freedom are assumed to be cc (infinity). The KHAT values are a measure of agreement or accuracy. A KHAT value is computed for each error matrix, and it is a measure of how well the classification of a forest management database (NRIS-FSVeg, OI and IFMAP) agrees with the FIA forest classification. This provides a means for testing the significance of the KHAT statistics for a two independent KHAT values, and therefore two error matrices that are significantly different. With this test, it is possible to compare each forest, management database with the FIA forest classification and the classification with higher accuracy was identified. The test statistics for testing if two independent error matrices are significantly different is expressed by |K1 - K2| Z = JVar(I?l)+ Var(K2). (10) Z is standardized and normally distributed (Congalton and Green 1999). Given the null hypothesis H0 :(Kl — K2) = 0, and the alternative Hl :(Kl — K2) at 0, H0 is rejected if Z 62 _>. Z a / 2 , where (1/2 is the confidence level if the two-tailed Z test and the degrees of freedom are assumed to be cc (infinity). Chapter Summary In summary, this chapter outlined the procedures to assess the accuracy of the thematic maps or databases with any reference data sets, which may be ground truth field data or another thematic map or any other source of reliable information. In this study, data were obtained from the MDNR, the Spatial Data Library of Michigan and national forests of Michigan. These data were compared for accuracy using the FIA field crew defined forest type classification (FLDTYPCD) as the reference for comparison. The research team requested the Forest Service North Central Research Station (NCRS) to compile each forest type classifications (FIA, FS Veg and IFMAP in national forest and FLA, 01 and IFMAP in state forest) taking FIA exact location as the point of reference for comparison. The objectives of consistency assessments of the NRIS-FSVeg in the national forest lands and 01 in the state forest lands and IFMAP in the public forest land of Michigan were carried out by computing the error matrix and computation of the overall accuracy, producer’s accuracy and user’s accuracy and Kappa statistic. Each pair of forest classification NRIS-FSVeg, 01 and IFMAP was compared for the agreement in classification by comparing the KHAT statistic. The Z test was carried out to test two independent KHAT values, and therefore two error matrices, for significant differences. By performing this test, the forest classification of higher accuracy was identified both for national forest land and state forest land. 63 CHAPTER 4 RESULTS AND DISCUSSION There are four primary sources of forest management information for public forest land in Michigan, namely IFMAP, 01 (for state forest lands) and NRIS-FSVeg (for national forest lands) and FIA, each with different characteristics (Table 6). IFMAP is a statewide raster map at the resolution of 30 meters by 30 meters and has information on land cover classification. 01 is a forest resource management database developed by the MDNR with the objective of supporting its day-to-day operational activity relating to resource management issues and land use (MDNR n. d.). OI has detailed information in databases covering information ranging from trees resources to special wildlife practices. Similarly, NRIS-FSVeg which is one of several natural resource information systems (NRIS) developed by USDA-FS, has stand-level information for the federally managed national forest land. NRIS-FSVeg has stand examinations, inventories and regeneration survey information to support management of national forests. It has wide range of information ranging from the trees to animal habitats. Besides these, the FIA sample plots have extensive information at the individual plot level. Selected aspects of these four databases are summarized in Chapter 1. The overall comparison of these forest management information databases in terms of defining forest, coverage, objective and other details is summarized in Table 6. 64 8088802 0:": ::0 03:08:): 00:0": 0: SEE": ”0:02 A050: :30: 00:: :0:0:00000:0 :80 80:0 :85 “000808: 03:00:80: 80880:: .wdv >858 .m.D 0: :00 8: .:0:0w0:ww0 80809808 80:: 80809808 :00:0: 8:08:80 0:: 000:00 :0:00:::000_0 :0>0: 00:000 0:02 3:80:30 :80 0:005 0:08 :80 :0:0:0Q 0800 :80 :0=0:0Q :00:0: :0 09¢. 08800 380:0 80808802 380:0 80808802 880:0 80808802 038080 80:: “003:0: 0.0 8088:: 800 89:08: :0 08:0 Z as. 03:00:. :00:0: 0:0:0 :80 000:0 :09»... 380800 00:>:0m 00:0": 080w 8:23: 98:20:: .2; :3 :0w0:08 00 88:82 :0 0:0:m :88: 3:300:05 0:00:8 :0:0:0Z .m.D _0::0:::00 0:::m ::080m0:08 80:0»0000 :00:0: 3:30 8:00:08 8752 0:80 :0 80808808 :00:0: 08:0: 0:00:0: .m.D :0 :0 80802: 088 00:00.00: :88: 0 :0 0:00: 80808808 080:: :00 080:0 0:: :0 :0:::0::0 0:: 80800 0 H :0 :00: 0:: :008 0 2:. 008000: :00:0: :008 0 H 8:08:88 000000 0 H 80:30: :0 03:00.30 :: cm: :000: :0 :0 523 :30:0 :00:0: 00 8:00: 823 8088:: oz 0 5:3 :38: :0 088m 0: 823 8:882 3:00 VNNNd: 0800000 0:00 0:0 3:80:00 0:00 0:0 3:80:00 0:00 0:0 088 :0 00:0 8:882 .368 :0 A30: :0 88m: 3:80:03: 80:03 :80 803: 00: 0%: 3:08:80 :0 80:80: :00:8:0: :0: 8:803: :0>00 :0 088080908 0: =3: :0:: :8: :0>00 8:080 :0: :80 058 82:: :0>00 :8: :00» 000:: :03. :0: 3:088: .:0>00 00:: :000 :0: :88» 0:: :0 00mm :0: :: :0 on 80:80:: :05 :8: 8:00:08 .080 w::>0: 3:088: :0 .080 :000: :0 80080 30:00 :0 050800 8:0: :00 80 :0 000:: :00:0: a: 80 :0 000:: 088 a: 00:0": :0 :0::8:0Q 000:: :0 8:80:08 3:090 80.8 :000: :< 8:080 8 o: :000: :< 3:080 :0 0.8 :000: :< :0>00 80:82:95: :0 J0>meEZ Sn: 0:0:0Q 3 8085:): 8 0000:80Q 8:08:88 ::080w0:0:2 00:0": :0 80.88800 .0 030:. 65 80035 8:33? 0: mg :80 ::080w0:0:>: 0::": :80 80:08:): :00:0n: 0: 2.1.9:": ”807: :0::00::000:0 E :0>Q: 03:00.30 ::080w0:08 :0 0:0: :00:00 38080: 02:00.30 ::080m0:08 :00:0: :000: 8800:0000 0:: w:080 :00:0000 0008000: :0::0: :008 0: 80:0 08:0 :0>0: :0:: :0 808000 808000 :0 :0>0: 0::0:::000< :008 0: 808000 3:0,: :0: 808000 0:30:00008 :008 :80 ::0:0n: :0 :0>0: 3:080:80: :0::08000000 008000: :80 80808808 80:80:80 8800:0000 8.0: 0:83 0:; ::08:0::>:0 08:00:80 :0:00:: :80 SEE»: 875:): 0:: 0:83 mmédmb 0:: 000:0:0: 0::: :0 8:3 mm: 008808.: 8:85 :00: 3:80:00 8:85 :00: 3:80:00 0:00: :0 8:80: 080.: 8:00:83~ :0::00:0 :00:0:0w0> 0:00:80 :0: 00 :80? .0088 :800 : 0:0: :0::0 .0008 .w0x008 .wo: 800:: .80 3:83.30: \ :0::0 80:9: :08: 80:30: 00:03 .0::::0 .0w:::0n:0 :00:00:0 :07: 0:80 :0:00:::000:0 85888 00000088.: 80:08 0:00: .0w0m .080: 3:080:07: 07: 82080: 8:38: :07: :98 0:2 :88 :0 0050.082 S: 0:500 3: 30.800: .0 030:. 66 In defining the forest area, FIA, NRIS-FSVeg and OI are based on the stocking of the forest trees. OI has the requirement of 20 cu ft of timber production per annum per acre. In contrast, IFMAP defines a forest based on the percentage of tree canopy cover. As discussed in Chapter 1, IFMAP and OI have forest types for the lower density cover or stocking of trees. Regarding the objective of each database, 01 and NRIS-FSVeg are very similar to each other. On the other hand, the FIA objectives are very different, and IFMAP’s objectives are very broad. Similarly, regarding the coverage, the IFMAP is a “wall-to-wall” coverage of the entire state of Michigan, and 01 and NRIS-FSVeg covers the state forest land and national forest land, respectively. Although, the FIA has only sample plots, it covers the continental US. From an inventory point of view, 01, NRIS-FSVeg and IFMAP are very close to each other in that they yield a map. In contrast, FIA is extensive in terms of coverage and intensive at the sample unit level. In terms of forest type classification, FIA and NRIS- FSVeg are more detailed and 01 forest classification is more specialized to suit the needs of forest management in Michigan. IFMAP has Level II and HI land cover classification. The users of these databases are different; 01 and NRIS-FSVeg are used within the state and federal forest management agencies. IFMAP and FIA data can be used for a variety of reasons by large numbers of users. The anticipated levels of accuracy of these databases are very close to each other. It will be fair to say that the accuracy of these databases are more related to amount of resources spent to create them. In summary, there are multiple sources of forest management information in Michigan, developed for 67 specific purposes. Prospective users should use either one or multiple sets of databases based on their coverage and objective of use. Only FIA and IFMAP information have complete statewide forest coverage in Michigan. To the extent'that these databases are compatible to each other, they will provide more robust information for forest resource and wildlife habitat management and for sustainable use of ecosystem services. Using FIA Forest Types as the Reference Classification FIA has sample plot information, which are called “conditions” that were derived by “the discrete combination of landscape attributes that define the condition” (Miles et al. 2001) of the particular sampling unit. Forest type is one of the important information items included in the condition attributes. In addition, FIA has tree information that describes each tree over 1 inch in diameter found on subplots. In this study, the FLDTYPCD (a forest type assigned by FIA field crew) information from the condition attributes of FIA plots was used as the ground truth forest classification for assessing the accuracy of IFMAP, NRIS-FSVeg and 01 forest type information. The FIA plots were established using a systematic grid approach (USDA-F8 2004). The sample units for this study were chosen exactly at the FIA plot center location, so the sample units selected for this study could be taken as a sub-population of systematic sampling. The sets of data were filtered by a complex process of FIA data filtering before release. After completing the data filtering process of FIA, NCRS released the classification of 239 FIA sample plots (out of 397 possible) for state forest land and 523 68 sample plots (out of 753 possible) for national forest land (National FIA Database System). The 239 datasets for state forest land was due to the partial digitization (only 40%) of the state forest land database in 01. Accuracy assessments of this study were based on these sample units. As discussed in Chapter 1, there were differences between the FIA, OI/NRIS-FSVeg and IFMAP classification schemes. While developing the crosswalk table between classifications, one-to-one relationships between the reference classifications to the “map” classification were expanded to include cases where categories were “acceptable” or “probably right”. In a previous study, Congalton and Green (1999) used a similar approach in the California hardwood rangeland monitoring project. In this way, one-to- many relationships between the reference classification and “map” classification were identified for some reference classes. The most likely category is referred to as the “primary classification”, and “acceptable” or “probably right” category is referred as the “secondary classification”, of classified map hereafter. In this study, for example, Jack pine in FIA may be reasonably comparable to Jack pine and Oak-Jack pine (secondary) in NRIS-FSVeg. The author had only the FIA field crew defined forest classification, the IFMAP classification and OI or NRIS-FSVeg classification. Due to FIA confidentiality concerns, further details from any data sources would have significantly reduced the number of sample plots available. 69 Congalton and Green (1993) identified eight factors that can affect error matrix or difference matrix results, and these factors provide useful framework for interpretation of the results of this study. The factors were: (1) land-cover change between the time of reference data acquisition and satellite data acquisition, (2) reference sample location error, (3) reference label data entry error, (4) reference label photo interpretation error, (5) inconsistent labeling of reference data due to land-cover heterogeneity surrounding the sample location, (6) difference in map and reference data registration, (7 ) map delineation error, and (8) map classification error (W ickham et al. 2004 cited as Congalton and Green 1993). Wickham and his colleagues (2004) also realized the usefulness of these factors for the interpretation of the results of error matrix. In this study, mainly factors (4) and (5) were important. To deal with these factors, measures were taken while developing the forest classification crosswalks between the reference classification and the classified map or database classification. This measure permitted matches for a number of reference forest types to multiple “map” forest classification types as a potential agreement between the two classifications. In other word, this assumption had made one reference forest classification to many classified forest classifications as “acceptable” classifications. For example, the Black Spruce of FIA was matched with the Lowland Coniferous Forest, Lowland Shrub and Mixed Non—Forested Wetland. These types of relaxation in matching of two classifications may overstate the accuracy of a map or database. In addition, FIA represents “the four 1/24th acre” subplots, IFMAP represents 900 square meters of area and the 01 and NRIS-FSVeg represents an entire stand with a particular 70 forest type. In other words, the relative size of the area represented by these classifications was different. A recent study by Smith et al. (2002) revealed that “accuracy decreases as land cover heterogeneity increases and-as patch size decreases”. Further the author emphasized these landscape variables remain significant factors in explaining classification accuracy. As in this study, both the information on classified map/database and the reference map/database were secondary source information, so the author had to rely on a crosswalk between classifications to make comparisons. In the forest classification crosswalk tables (Appendix A), there were multiple matches of FIA forest classification with a classified classification of IFMAP, 01 and NRIS-FSVeg and OI/NRIS-FSVeg as reference classification with IFMAP as the classified classification. This type of one-to-many relationship between reference and classified map limited direct formulation of the Error matrix and consequently the assessment of the chance agreement and calculation of the KHAT statistic. The accuracy assessments in this study were made from the difference classification matrix. A user’s accuracy of 98.0% (Table 7) for Lowland Conifer Forest means that 50 of 51 of the pixels classified as Lowland Conifer Forest were lowland conifer forest types according to FIA. User’s accuracy is sometimes called “reliability” in the comparison scheme (J anssen and Van der Wel 1994). For example, 18 of 37 Jack pine plots (FIA, 101) are classified as Pines or acceptable Lowland Coniferous Forest (Table 7). Similarly, dividing the number of correctly classified samples by the total column total yields the producer’s accuracy: it indicates the percentage of samples of certain 71 (reference) class that were correctly classified in the comparison scheme (J anssen and Van der Wel 1994). Accuracy Assessment for State Forest Lands (MDNR) To assess the accuracy of the IFMAP classification, FIA and 01 classifications were considered as reference classifications. And to assess the agreement between 01 and FIA forest classification, the FIA forest classification was used as reference classification. In other words, the accuracy of IFMAP and 01 classification was assessed based on FIA classification (Tables 7-9). IFMAP overall accuracy was found to be 63.6% with FIA as the reference classification and 60.3% with 01 as the reference classification. The overall accuracy of 01 was 84.5% compared to FIA as the reference classification. There were differences between the FIA, 01 and IFMAP classification schemes that limited the formulation of the error matrix and simultaneously calculating the chance agreement and calculation of KHAT statistic. The match between the reference classification and classified classification were not along the diagonal as usual. The matched cells were shaded to identify them (Tables 7- 12). Producer’s and user’s accuracy are in the last row and column of the difference classification matrix. The user’s accuracy for the Lowland Conifer was more than 94% in comparison with FIA and OI (T ables 7-8). Thus, IFMAP did well in identifying the Lowland Conifer Forest type. The user’s accuracy for Oak Association was as low a323.5% in comparison to the FIA classification (Table 7) and 29.4% in comparison to 01 72 582 .25 ”8533 2&5 8m .25 525 322208832. swam .8» ”Boga; 2.2% 852 5% 532.58 8mg; .mon $80 32 .5 gases. ea 82% x85 mNfi an Ema—am Nanci and .NA: REE x02. .2: ”movoo <3 $8.58“ {8: .858.“ =Eo>0 0N N m aN o m vm d mN _ 2 3 5332 Nm fl QMN om v n O». _ h an _ 0N m 2 NM .88. ode m m _ ~ N _ 98:03 388-:02 U222 om w.mm N. 2 v m a c ~ 2.5% 9:233 wN 93 On gm .Nm ~ 2 fl _ u 680m anchor—50 cam—33 mN od 0 v N m _ 620m 3022009 6:233 «N 9% N. 3 m ~ ~ _ N v 320% @822 9833 NN ed 0 _ _ muotcou can—ab @822 N v.2. 5N fin N _ _ m 2 E 3:5 a 0.8 _N mm “N N N. _ N ~ _ seafloommxx 5&3. @— mdN v S 2 N v confloom?‘ Mao 3 wNm mm ow m N m wN ~ _ couwmoommxx 3 803633 505.52 06 o cm 0 v ooh Nm £2333 \ seam Baa: od o 2 N a a 2 22:80 asoaeom 2 935 :2“: fig :5 4 8m . 8m 8w .on .. 1 8m . .5. mos. , m2. 92 ENE .N ..__.§ ,w ,. .. $.23. ., .. 88 $96 932 cop—880% n52 BEmmflu 73 5&2 .88 ”@529; 2%.: 8m .SN 525 32328333? swam .§ ”80322 22% 352 .on ”beige 8.3:? New ”58 32 .NN~ gasefi .eN_ 603mm xoflm mN~ "E Ema—mm _N~ 65m com .NS 65a :03. .2: ”380 <5 .3538 €me J5 S0238 €02.85 .o mm N m «m o N mm _ _N 2 2 NN 6.22 NON NNN an a. m 9. _ N am _ 0N N : NN 38 NS ON _N 2 2 o N 5:80 0 953m c222 ode n n m 555 2233 4 od o _ _ €32; Neaam m 32 N N N am Bo; o N.NN mm we R N n _ _ N _ N :83 < N.NN :. 3. m N n E cooauam Eofioz 2 2% N N_ _ N 4 x8 0 N.NN N ”N mN m 380 0 2x: a _ _ c.8355 N 32 N N N 88% 335 m N.NN 2 M: N 2 N 2E Em m EN NN .m _ m NN 2% 8.2: N N. <2 53: 38. 8a 8m 8» SN ONm mom NN. 0N. mNN ,5 N2 :: 538530 8 88 GEE NEE oococflom 838mm 35820 €53 awoken 88m 5.. Sm 98 HO mo x532 855:5 .w 2an 74 28825.2. . .2. "ooEqm 2225 .m 2223.5 22222223012 {2 32220 .0 60032832 502222072 .2 622222 3m .2 6:222 282. .2. 68392222 aEaBm .22 220222200 953m 3x22 20 ”wom c822. .Q 2.222on .0 E2232 .< $350 20 4822222822 M2822 .mu2-w2~22 28222822 M2822 .