Open Access
3 June 2014 Identification of high temperature targets in remote sensing imagery based on factor analysis
Yi-Fan Yu, Jun Pan, Li-Xin Xing, Li-Jun Jiang, Shu Liu, Yue Yuan, Hua-Liang Yu
Author Affiliations +
Abstract
Identification of high temperature targets has great significance to environmental monitoring, disaster warning, resources investigation, and so on. It is also an important basis for the temperature inversion of high temperature targets. Factor analysis starts from the similarity matrix of variables or samples. It sums the multiple variables or multiple samples up to a few factors via performing correlation analysis on them. It can extract information with the least amount of information loss. R-mode factor analysis is conducted to ETM+ remote sensing imagery to get the relationship among band variables. The fire factor, which has indicative significance for the high temperature targets, is confirmed on the basis of factor loading matrix. The mixture tuned matched filtering method is adopted in this article to use factor scores to realize high temperature target recognition. The identification precision reaches 95% in the field confirmation.

1.

Introduction

Factor analysis is a multivariate analysis method which conducts a comprehensive analysis on observed data of multiple variables and multiple samples.1 Based on the study of a similarity matrix of variables or samples, it sums perplexing multiple variables or multiple samples up to a few factors. The combinational relationships of variables or samples are analyzed, so that the essential factors, which play leading roles in the information extraction process, are acquired.

Factor analysis has been used to classify geological samples, to get the main factors of geological structures, to obtain geochemical information of different regions, to explain spatial variables, and to analyze topographic variables from which the hydrological factors can be extracted.26 R-mode factor analysis has been used to do geochemical division and estimate water quality.1,7 After being combined with clustering analysis, it provides a new quantitative method which has explicit geological significance.8

High temperature targets refer to the surface cover types in which temperatures are obviously higher than the temperatures of normal surface cover types (300K±). These abnormally high temperatures, such as forest fires, grassland fires, coal seam spontaneous combustions, volcanic eruptions, etc., are generally higher than 500 K. Due to the higher temperature, their emitted energy in the shortwave infrared (1.33.0μm) can be equal to or even higher than the reflected energy of the surface cover types with normal temperatures. This is a significant feature of high temperature targets and can be derived from the blackbody radiation function.9 High temperature targets have a great significance in environmental monitoring, disaster warning, and resources investigation. In this article, factor analysis is conducted to recognize high temperature targets in remote sensing imagery.

2.

Method

Factor analysis is divided into R-mode factor analysis and Q-mode factor analysis. R-mode factor analysis is mainly used for studying the relationships among variables and realizing the classification of samples, while Q-mode factor analysis is mainly used for studying the relationships among samples and realizing classification of variables. Both of them can be unified by correspondence analysis.10

R-mode factor analysis starts from a similarity matrix/correlation coefficient matrix of variables. Through the correlation analysis of variables, it sums multiple variables up to a few factors. The information can be extracted with the least amount of information loss. Each factor extracted can be considered as a linear combination of the original variables, so that the group characteristics of variables can be analyzed and the thematic significance of every factor will be achieved.

R-mode factor analysis follows the form

Eq. (1)

X=FAT,
where X is an n×m order original data matrix; n is the number of samples/pixels; m is the number of variables/bands; A is an m×p order matrix called the loading matrix that represents the correlation between variables and factors; F is an n×p order matrix called the factor score matrix that represents the correlation between samples and factors; and p is the number of factors.

The calculation process of R-mode factor analysis is as follows:

  • (1) Column standardization processing of original data matrix X (The standardized matrix is also expressed in the term of X).

  • (2) Calculation of correlation coefficient matrix R:

    Eq. (2)

    R=1nXTX.

  • (3) Calculation of eigenvalue Λ and eigenvector T of R.

  • (4) Calculation of factor loading according to the number of factors p:

    Eq. (3)

    A=TΛ12.

  • (5) Calculation of the matrix of factor score F:

    Eq. (4)

    F=XAΛ1.

3.

Data Sources

The study area lies on the intersection of Fugu in Shanxi Province and Baode in Shaanxi Province, China (38°39N39°35N, 110°22E111°19E) (Fig. 1). Bode and Fugu are located on different sides of the Yellow River. The study area is abundant in coal resources. Around the year 2002, high temperature targets, such as coke ovens and metal smelting factories, were widely distributed in this area. One Landsat-7 ETM+ scene acquired the date of July 14, 2002, is selected in this study. A series of imaging processing steps, such as radiometric calibration, atmospheric correction, and clipping, have been performed first. Through the previous processes mentioned above, for remote sensing imagery pixels, their radiant energy should be the sum of the reflected and emitted energies due to the high temperature targets in them. We call the reflectivity of pixels visual reflectivity11 and the physical expression is9

Eq. (5)

ρ0=M1S+M2S+M3(1S)+M4(1S)TθE0cosθ,
where ρ0 is the visual reflectivity of a mixed pixel; M1 is the radiant flux density of emission of the high temperature targets in it; M2 is the radiant flux density of reflectance of the high temperature targets in it; M3 is the radiant flux density of emission of normal temperature surface cover types in it; M4 is the radiant flux density of reflectance of normal temperature surface cover types in it; S is the percent of area of high temperature targets for mixed pixels; θ is sun zenith angle; E0 is the solar irradiance of the upper bound of the atmosphere; and Tθ is the transmittance of the atmosphere.

Fig. 1

Image of study area in ETM+ 7.

JARS_8_1_083622_f001.png

There are nine kinds of surface cover types in the study area. They are residential area, road, forest land, cultivated land (in the hilly area), cultivated land (in the lowland), river, flood plain, gully, and high temperature targets. In the view of the spectral analysis,12 30 representative pixels in each kind of the surface cover type are selected from remote sensing imagery, respectively. There are 270 samples in total. KMO (Kaiser–Meyer–Olkin) and Bartlett spherical degree test13 have been done to all of the selected samples. KMO statistics is an indicator which is used to compare simple correlation and partial correlation coefficients of variables. Its value range is [0, 1]. The greater the KMO value, the more suitable it is for the factor analysis of the original variables to be put into effect. Bartlett is the indicator that is used to test the difference between the actual correlation matrix and the unit correlation matrix. When the value of the significance test of Bartlett is less than a given reliability, the correlation between the original variables is significant. Thus, the original variables are suitable for factor analysis. The results of these tests show that the KMO value is greater than 0.6 and the Bartlett value is less than 0.05, which satisfy the premise of factor analysis. In R-mode factor analysis of the study area, representative samples, eigenvalues, and information quantity are in Table 1 and the factor loading matrix in Table 2 is achieved. As the accumulated information quantity of first three factors has reached 98.141% and satisfies with the requirements of little information loss, this article will focus on the first three factors with larger amounts of information and give detailed explanations of the analysis of them.

Table 1

Correlation matrix eigenvalues and information quantity calculated by the representative samples.

FactorEigenvalueInformation quantity (%)Accumulated information quantity %
Factor 13.98266.36566.365
Factor 21.12718.78585.150
Factor 30.77912.99098.141
Factor 40.0590.98199.121
Factor 50.0480.79399.914
Factor 60.0050.086100.000

Table 2

Factor loading matrix calculated by the representative samples.

Factor 1Factor 2Factor 3
Band 10.9090.3640.107
Band 20.9290.3600.054
Band 30.9110.3890.019
Band 40.5880.3980.698
Band 50.8210.5400.037
Band 70.6650.5140.526

4.

Experimental Results

The R-mode factor loading matrix reflects the relationship between variables and factors. Each factor can be considered as a linear combination of each variable. The thematic significance of R-mode factors is determined based on the understanding of the physical meanings of the variables of each band. The element aij in the factor loading matrix shows the important degree of variable i of the factor j. The greater the absolute value of aij, the more important variable i is to the factor j. That is to say that the thematic meaning of the factor can be described by the variables with larger absolute values of aij. Factor 1 represents the total linear combination of the visual reflectivity of each band and it is called the brightness factor. Its loading is positive in each band. Factor 2 represents the difference between near-infrared and visible light reflectivity and it is called the vegetation factor. The absolute values of factor loading at the fourth and seventh band variables are larger in Factor 3 (i.e., f3). That is to say, f3 mainly represents the information of bands 4 and 7. The inverted result of f3 is equal to the difference value of the combination of bands 7 and 4. The approximate expression of f3 is

Eq. (6)

f30.526ρ70.698ρ4,
where ρ4 is the visual reflectivity of band 4, and ρ7 is the visual reflectivity of band 7.

The main difference between high temperature targets and other surface cover types is that for the targets with high temperature, ρ7 is larger and ρ4 is smaller. Hence, the shared similar principles with the enhanced vegetation index14 (EVI=ρ4ρ3, ρ3 is the visual reflectivity of band 3) are used to enhance the information of vegetation, and f3 can be used to enhance the information of high temperature targets. Calculate the value of f3 for the pixels and the greater the inverted value of f3 is, the more likely it is that there are high temperature targets in them. At the same time, f3 is similar to the normalized difference fire index (NDFI)11 [NDFI=(ρ7ρ4)/(ρ7+ρ4)] in principle, while NDFI is also similar to the normalized difference vegetarian index (NDVI)14 [NDVI=(ρ4ρ3)/(ρ4+ρ3)] in its principle of enhancing vegetation. Like NDFI, f3 can enhance the temperature of high temperature targets, so that f3 can be named a fire factor.

In the light of factor analysis with representative samples, the authors of this article calculate the factor score for the factor loading matrix with the study area pixel data, and get a factor score image (Fig. 2) and representative samples factor illustration (Fig. 3). The R-mode factor score matrix reflects the relationship between samples and factors, thus the DN value in each factor score image represents the weight of the composition of lightness, vegetation, and fire factors. The reflectivity of the flood plain in each band is very high, so that it shows a high brightness value in the score image of Factor 1. Forest and cultivated land have higher compositions of vegetation, so that they both have high brightness values in the score image of Factor 2. Pixels of high temperature objects in the score image of Factor 3 have the highest brightness values. In the factor illustration, all representative pixel samples show different clustering characteristics of points. The pixel samples can be classified and used to recognize targets.

Fig. 2

Factor score image. (a) Factor false color image (123 RGB), (b) Factor 1 image, (c) Factor 2 image, (d) Factor 3 image, (e) the subset of (d).

JARS_8_1_083622_f002.png

Fig. 3

Factors 1, 2, 3 score scatter plot. (a) Factors 1–2 scatter plot, (b) Factors 1–3 scatter plot, (c) Factors 2–3 scatter plot.

JARS_8_1_083622_f003.png

Because of the factor score, a mixture tuned matched filtering,15 (MTMF, the identification method which uses a matched filtering score and infeasibility to measure the similarity degree between unknown pixels and known samples), is conducted to high temperature objects of known samples. After the threshold is set according to the matched filtering score image and infeasibility image scatter plot, 300 pixels of high temperature targets are acquired. Then they are verified one by one in the field. The result of the field verification demonstrates the following facts. 285 pixels of targets are recognized successfully. Most of them are from the coking plants or metal smelting plants, and a few of them are from thermal power plants. All the extracted target pixels have high temperature property meanings. There are 15 error pixels, and among them, 8 pixels are from the flood plains and gullies. The reason for this misjudgment is that they have higher values in the curves of all the bands. The other seven pixels are near high temperature pixels. The reason for this misjudgment is that they have a higher similarity on the spectrum to high temperature targets. The identification precision of MTMF reaches 95%.

In addition, via using a monowindow algorithm,16 the temperature has been inversed from the thermal infrared remote sensing imagery (ETM+ 6) with the same time phase. The pixels in the thermal imagery, which match the positions of the 285 pixels mentioned above, served as thermal infrared abnormal areas, and the other areas are taken as the normal temperature background in the process of statistical analysis of temperature. The results are listed in Tables 3 and 4. The results show that the values of the normal temperature background and high temperature targets are relatively close, which makes it more difficult to distinguish the targets from the background. Moreover, the target with the highest temperature in the results of this inversion is 324.21 K, which is quite different from its actual temperature (500 K+). The temperature of high temperature targets inversed from the shortwave infrared imagery (Table 5) based on blackbody radiation characteristics17 are relatively consistent with the actual temperature. That is to say, thermal infrared remote sensing data fail to reflect the features of high temperature targets properly. The main reason for this case is that the spatial resolution of thermal infrared remote sensing imagery is 60 m × 60 m and of shortwave infrared remote sensing imagery is 30 m × 30 m. The high temperature targets with the equal temperature and area are more easily weakened by the background with a normal temperature in the thermal infrared remote sensing imagery than those in the shortwave infrared remote sensing imagery.

Table 3

Inversion temperature of thermal infrared high targets in the study area.

MinMaxMeanSDModeMedian
T (K)307.41324.21314.083.63315.65313.39

Table 4

Inversion temperature of thermal infrared normal temperature background in the study area.

MinMaxMeanSDModeMedian
T (K)294.06323.21311.027.43311.80310.51

Table 5

Inversion temperature of shortwave infrared high targets in the study area.

MinMaxMeanSDModeMedian
T (K)499.38608.19574.6424.88608.14575.90

5.

Conclusion

R-mode factor analysis starts from the similarity matrix/correlation coefficient matrix of variables. It can sum multiple variables up to a few factors with little information loss. This article uses multispectral remote sensing data in its study. In the results of R-mode factor analysis, a factor loading matrix or factor loading curves reflect the correlation between band variables and factors. Each R-mode factor can be considered as a linear combination of each bands’ variables, so that the group characteristics of variables can be analyzed and the thematic significance of every factor will be achieved.

The score matrix of R-mode factor analysis reflects the correlation between samples and factors. In the results of R-mode factor analysis, a factor score matrix represents the factor composition weight of pixel samples and can be used to classify pixel samples and recognize targets.

Factor loading and factor score have clear thematic significance, which allows the adoption of MTMF to recognize targets. The result of the field verification shows that all the extracted target pixels have high temperature property meanings, and the identification precision reaches 95%.

Acknowledgment

This article was supported by the Higher Specialized Research Fund for the Doctoral Program funding under Grant number 20110061120067.

References

1. 

Y. X. ShiH. J. JiJ. L. Lu, “Factor analysis method and application of stream sediment geochemical partition (in Chinese with English summary),” Geol. Prosp., 40 (5), 73 –76 (2004). http://dx.doi.org/10.3969/j.issn.0495-5331.2004.05.014 Google Scholar

2. 

X. W. MengD. W. DuJ. L. Wu, “Factor analysis for compositional data and its application to the classification of geological samples (in Chinese with English summary),” J. Jilin Univ., 30 (4), 367 –370 (2000). http://dx.doi.org/10.3969/j.issn.1671-5888.2000.04.012 JDXLAW 1671-5489 Google Scholar

3. 

H. ZhangM. Y. HeY. P. Liu, “Geological factor analysis in the search for the ore-forming geochemical information and the application of construction (in Chinese with English summary),” Acta. Univ. Pekinensis, 29 367 –370 (2009). Google Scholar

4. 

H. Q. Chen, “SPSS in the application of multi-objective geochemical partition based on factor analysis (in Chinese with English summary),” China High Tech. Enterprises, 22 71 –73 (2001). http://dx.doi.org/10.3969/j.issn.1009-2374.2011.22.026 Google Scholar

5. 

M. J. UmH. YunC. S. Jeong, “Factor analysis and multiple regression between topography and precipitation on Jeju Island, Korea,” J. Hydrol., 410 (3–4), 189 –203 (2011). http://dx.doi.org/10.1016/j.jhydrol.2011.09.016 JHYDA7 0022-1694 Google Scholar

6. 

I. J. Janneke, “Environmental conditions in the Donggi Cona lake catchment, NE Tibetan Plateau, based on factor analysis of geochemical data,” J. Asian Earth Sci., 44 176 –188 (2012). http://dx.doi.org/10.1016/j.jseaes.2011.04.021 1367-9120 Google Scholar

7. 

A. Ahmad, “R-mod fact or analysis, a popular multivariate statistical technique to evaluate water quality in Khaf-Sangan basin, Mashhad, Northeast of Iran,” Arab. J. Geosci., 6 (3), 893 –900 (2011). http://dx.doi.org/10.1007/s12517-011-0367-7 AJGRAA 1866-7511 Google Scholar

8. 

H. H. LvM. D. RenJ. C. Liu, “Application of Q-model principal factor analysis and clustering method in evaluation on sandstone reservoirs of the neogene in huatugou oilfield, qaidam basin,” Acta. Univ. Pekinensis, 42 (6), 740 –745 (2006). http://dx.doi.org/10.3321/j.issn:0479-8023.2006.06.008 Google Scholar

9. 

Y. F. YuJ. PanL. X. Xing, “Feasibility analysis of shortwave infrared band for recognition of high temperature target,” Remote Sens. for Land and Resources, 26 (1), 25 –30 (2014). http://dx.doi.org/10.6046/gtzyyg.2014.01.05 Google Scholar

10. 

D. M. ZengH. J. JiW. Gao, “The R-Q mode factor analysis and correspondence analysis (In Chinese with English summary),” Comput. Tech. Geophys. Geochem. Exp., 30 (1), 78 –80 (2008). http://dx.doi.org/10.3969/j.issn.1001-1749.2008.01.018 Google Scholar

11. 

Y. J. ZhuL. X. XingJ. Pan, “Method of identifying high-temperature target using shortwave infrared remote sensing data (in Chinese with English summary),” Remote Sens. Info., 26 (6), 33 –36 (2011). http://dx.doi.org/10.3969/j.issn.1000-3177.2011.06.007 Google Scholar

12. 

Y. F. YuJ. PanL. X. Xing, “Identification of high temperature targets in remote sensing imagery based on Mahalanobis distance (in Chinese with English summary),” Remote Sens. Info., 28 (5), 90 –94 (2013). http://dx.doi.org/10.3969/j.issn.1000-3177.2013.05.017 Google Scholar

13. 

D. D. Li, “Research based on factors analysis of the service outsourcing industry competitiveness evaluation—China’s service outsourcing model city as an example (in Chinese with English summary),” Suzhou Univ., (2012). http://dx.doi.org/10.7666/d.y2121725 Google Scholar

14. 

A. HueteK. DidanT. Miura, “Overview of the radiometric and biophysical performance of the MODIS vegetation indices,” Remote Sens. of Environ., 83 (1–2), 195 –213 (2002). http://dx.doi.org/10.1016/S0034-4257(02)00096-2 RSEEA7 0034-4257 Google Scholar

15. 

J. W. Boardman, “Leveraging the high dimensionality of AVIRIS data for improved sub-pixel target unmixing and rejection of false positives: mixture tuned matched filtering,” in Summaries 7th Annu. JPL Airborne Geoscience Workshop, 55 –56 (1998). Google Scholar

16. 

Z. H. Qinet al., “Mono-window algorithm for retrieving land surface temperature from Landsat TM 6 data (in Chinese with English summary),” Acta. Geographica Sinica, 56 (4), 456 –466 (2001). Google Scholar

17. 

J. PanL. X. XingJ. C. Wen, “Inversion method study on short wave infrared remote sensing data high temperature surface feature temperature,” in Proc. IEEE 2nd Int. Congress on Image and Signal Process., 1 –4 (2009). http://dx.doi.org/10.1109/CISP.2009.5301511 Google Scholar

Biographies of the authors are not available.

CC BY: © The Authors. Published by SPIE under a Creative Commons Attribution 4.0 Unported License. Distribution or reproduction of this work in whole or in part requires full attribution of the original publication, including its DOI.
Yi-Fan Yu, Jun Pan, Li-Xin Xing, Li-Jun Jiang, Shu Liu, Yue Yuan, and Hua-Liang Yu "Identification of high temperature targets in remote sensing imagery based on factor analysis," Journal of Applied Remote Sensing 8(1), 083622 (3 June 2014). https://doi.org/10.1117/1.JRS.8.083622
Published: 3 June 2014
Lens.org Logo
CITATIONS
Cited by 5 scholarly publications.
Advertisement
Advertisement
KEYWORDS
Factor analysis

Remote sensing

Infrared radiation

Thermography

Infrared imaging

Reflectivity

Statistical analysis

Back to Top