Model observers designed to predict human observers in detection tasks are important tools for assessing task-based image quality and optimizing imaging systems, protocol, and reconstruction algorithms. Linear model observers have been widely studied to predict human detection performance, and recently, deep learning model observers (DLMOs) have been developed to improve the prediction accuracy. Most existing DLMOs utilize convolutional neural network (CNN) architectures, which are capable of learning local features while not good at extracting long-distance relations in images. To further improve the performance of CNN-based DLMOs, we investigate a hybrid CNN-Swin Transformer (CNN-SwinT) network as DLMO for PET lesion detection. The hybrid network combines CNN and SwinT encoders, which can capture both local information and global context. We trained the hybrid network on the responses of 8 human observers including 4 radiologists in a two-alternative forced choice (2AFC) experiment with PET images generated by adding simulated lesions to clinical data. We conducted a 9-fold cross-validation experiment to evaluate the proposed hybrid DLMO, compared to conventional linear model observers such as a channelized Hotelling observer (CHO) and a non-prewhitening matched filter (NPWMF). The hybrid CNN-SwinT DLMO predicted human observer responses more accurately than the linear model observers and DLMO with only the CNN encoder. This work demonstrates that the proposed hybrid CNN-SwinT DLMO has the potential as an improved tool for task-based image quality assessment.
Quantitative imaging biomarkers (QIBs) hold enormous potential to improve the efficiency of clinical trials that use standard-of-care CT imaging. Examples of QIBs include size, shape, intensity histogram characteristics, texture, radiomics, and more. There is, however, a well-recognized gap between discovery and the translation to practice of QIBs, which is driven in part by concerns about their repeatability and reproducibility in the diverse clinical environment. Our goal is to characterize QIB repeatability and reproducibility by using virtual imaging clinical trials (VICTs) to simulate the full data pathway. We start by estimating the probability distribution functions (PDFs) for patient-, disease-, treatment- , and imaging-related sources of variability. These are used to forward-model sinograms that are reconstructed and then analyzed by the QIB under evaluation in a virtual imaging pipeline. By repeatedly sampling from the variability PDFs, estimates of the bias, variance, repeatability and reproducibility of the QIB can be generated by comparison with the known ground truth. These estimates of QIB performance can be used as evidence of the utility of QIBs in clinical trials of new therapies.
There is a tremendous potential for AI-based quantitative imaging biomarkers to make clinical trials with standardof- care CT more efficient. There is, however, a well-recognized gap between discovery and the translation to practice for AI-based imaging biomarkers. Our goal is to enable more efficient and effective imaging clinical trials by characterizing the repeatability and reproducibility AI-based imaging biomarkers. We used virtual imaging clinical trials (VCTs) to simulate the data pathway by estimating the probability distributions functions for patient-, disease-, and imaging-related sources of variability. We evaluated the bias and variance in estimating the volume of liver lesions and the variance of an algorithm, that has shown success in predicting mortality risk for NSCLC patients. We used the volumetric XCAT anthropomorphic simulated phantom with inserted lesions with varied shape, size, and location. For CT acquisition and reconstruction we used the CatSim package and varied acquisition mAs and image reconstruction kernel. For each combination of parameters we generated 20 independent realizations with quantum and electronic noise. The resulting images were analyzed with the two AI-based imaging biomarkers described above, and from that we computed the mean and standard deviation of the results. Mean values and/or bias results were counter-intuitive in some cases, e.g. lower mean bias in scans with lower mAs. Addition of variations in lesion size, shape and location increased variance of the estimated parameters more than the mAs effects. These results indicate the feasibility of using VCTs to estimate the repeatability and reproducibility of AI-based biomarkers used in clinical trials with standard-of-care CT.
Positron emission tomography (PET) imaging has emerged as a standard component for cancer diagnosis treatment and has increasing use in clinical trials of new therapies for cancer and other diseases. The use of PET imaging to assess response to therapy and its ability to measure change in radiotracer uptake is motivated by its potential for quantitative accuracy and high sensitivity. However, the effectiveness depends upon a number of factors, including both the bias and variance in the pre- and post-therapy reconstructed images. Despite all the attention paid to image reconstruction algorithms, little attention has been paid to the impact on task performance of the choice of algorithm or its parameters, even for FBP or OSEM. We have developed a method, called a 'virtual clinical trial', to evaluate the ability of PET imaging to measure response to cancer therapy in a clinical trial setting. Here our goal is to determine the impact of a fully-3D PET reconstruction algorithm and parameters on clinical trial power. Methods: We performed a virtual clinical trial by generating 90 independent and identically distributed PET imaging study realizations for each of 22 original dynamic 18F-FDG breast cancer patient studies pre- and post- therapy. Each noise realization accounted for known sources of uncertainty in the imaging process, specifically biological variability and quantum noise determined by the PET scanner sensitivity and/or imaging time, as well as the trade-offs introduced by the reconstruction algorithm in bias versus variance. Results: For high quantum noise levels, due to lower PET scanner sensitivity or shorter scan times, quantum noise has a measurable effect on signal to noise ratio (SNR) and study power. However, for studies with moderate to low levels of quantum noise, biological variability and other sources of variance determine SNR and study power. In other words, the choice of the fully-3D PET reconstruction algorithm and parameters has minimal impact on task performance. Conclusions: For many clinical trials, the variance aspects of 3D PET and reconstruction method and parameters have minimal to no impact. Variance for other factors, and bias introduced by changes in 3D PET reconstruction between scans can dramatically impact the utility of clinical trials that rely on quantitative accuracy.
This erratum corrects an error in “Measuring temporal stability of positron emission tomography standardized uptake value bias using long-lived sources in a multicenter network,” by D. Byrd et al.
Positron emission tomography (PET) is a quantitative imaging modality, but the computation of standardized uptake values (SUVs) requires several instruments to be correctly calibrated. Variability in the calibration process may lead to unreliable quantitation. Sealed source kits containing traceable amounts of Ge68 / Ga68 were used to measure signal stability for 19 PET scanners at nine hospitals in the National Cancer Institute’s Quantitative Imaging Network. Repeated measurements of the sources were performed on PET scanners and in dose calibrators. The measured scanner and dose calibrator signal biases were used to compute the bias in SUVs at multiple time points for each site over a 14-month period. Estimation of absolute SUV accuracy was confounded by bias from the solid phantoms’ physical properties. On average, the intrascanner coefficient of variation for SUV measurements was 3.5%. Over the entire length of the study, single-scanner SUV values varied over a range of 11%. Dose calibrator bias was not correlated with scanner bias. Calibration factors from the image metadata were nearly as variable as scanner signal, and were correlated with signal for many scanners. SUVs often showed low intrascanner variability between successive measurements but were also prone to shifts in apparent bias, possibly in part due to scanner recalibrations that are part of regular scanner quality control. Biases of key factors in the computation of SUVs were not correlated and their temporal variations did not cancel out of the computation. Long-lived sources and image metadata may provide a check on the recalibration process.
Clinical trials that evaluate cancer treatments may benefit from positron emission tomography (PET) imaging, which for many cancers can discriminate between effective and ineffective treatments. However, the image metrics used to quantify disease and evaluate treatment may be biased by many factors related to clinical protocols and PET system settings, many of which are site- and/or manufacturer-specific. An observational study was conducted using two surveys that were designed to record key sources of bias and variability in PET imaging. These were distributed to hospitals across the United States. The first round of surveys was designed and distributed by the American College of Radiology’s Centers of Quantitative Imaging Excellence program in 2011. The second survey expanded on the first and was completed by the National Cancer Institute’s Quantitative Imaging Network. Sixty-three sites responded to the first survey and 36 to the second. Key imaging parameters varied across participating sites. The range of reported methods for image acquisition and reconstruction suggests that signal biases are not matched between sites. Patient preparation was also inconsistent, potentially contributing additional variability. For multicenter clinical trials, efforts to control biases through standardization of imaging procedures should precede patient measurements.
Due to the wide variability of intra-patient respiratory motion patterns, traditional short-duration cine CT used in respiratory gated PET/CT may be insufficient to match the PET scan data, resulting in suboptimal attenuation correction that eventually compromises the PET quantitative accuracy. Thus, extending the duration of cine CT can be beneficial to address this data mismatch issue. In this work, we propose to use a long-duration cine CT for respiratory gated PET/CT, whose cine acquisition time is ten times longer than a traditional short-duration cine CT. We compare the proposed long-duration cine CT with the traditional short-duration cine CT through numerous phantom simulations with 11 respiratory traces measured during patient PET/CT scans. Experimental results show that, the long-duration cine CT reduces the motion mismatch between PET and CT by 41% and improves the overall reconstruction accuracy by 42% on average, as compared to the traditional short-duration cine CT. The long-duration cine CT also reduces artifacts in PET images caused by misalignment and mismatch between adjacent slices in phase-gated CT images. The improvement in motion matching between PET and CT by extending the cine duration depends on the patient, with potentially greater benefits for patients with irregular breathing patterns or larger diaphragm movements.
Image heterogeneity metrics such as textural features are an active area of research for evaluating clinical outcomes with positron emission tomography (PET) imaging and other modalities. However, the effects of stochastic image acquisition noise on these metrics are poorly understood. We performed a simulation study by generating 50 statistically independent PET images of the NEMA IQ phantom with realistic noise and resolution properties. Heterogeneity metrics based on gray-level intensity histograms, co-occurrence matrices, neighborhood difference matrices, and zone size matrices were evaluated within regions of interest surrounding the lesions. The impact of stochastic variability was evaluated with percent difference from the mean of the 50 realizations, coefficient of variation and estimated sample size for clinical trials. Additionally, sensitivity studies were performed to simulate the effects of patient size and image reconstruction method on the quantitative performance of these metrics. Complex trends in variability were revealed as a function of textural feature, lesion size, patient size, and reconstruction parameters. In conclusion, the sensitivity of PET textural features to normal stochastic image variation and imaging parameters can be large and is feature-dependent. Standards are needed to ensure that prospective studies that incorporate textural features are properly designed to measure true effects that may impact clinical outcomes.
The unique electrical properties of Single-Walled Carbon Nanotubes make them good candidates for thermoacoustic
contrast agents. Theoretical considerations suggest that nanotubes are capable of greatly increasing a material's
absorption of electromagnetic radiation. We describe these properties and discuss our measurements of aqueous
nanotube solutions and nanotube-infused tissue mimicking phantoms. We discuss results and the difficulties currently
associated with making these measurements on nanotubes.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.