Residual W-shape network (ResWnet) for dual-energy cone-beam CT imaging

Xiao Jiang; Hehe Cui; Zihao Liu; Lei Zhu

doi:10.1117/12.2646505

17 October 2022 Residual W-shape network (ResWnet) for dual-energy cone-beam CT imaging

Xiao Jiang, Hehe Cui, Zihao Liu, Lei Zhu

Author Affiliations +

Proceedings Volume 12304, 7th International Conference on Image Formation in X-Ray Computed Tomography; 1230427 (2022) https://doi.org/10.1117/12.2646505
Event: Seventh International Conference on Image Formation in X-Ray Computed Tomography (ICIFXCT 2022), 2022, Baltimore, United States

Abstract

Deep learning has achieved great success in many medical imaging tasks without explicit solutions. In this work, learning method was applied to dual-energy cone-beam CT imaging. We proposed a Residual W-shape Network (ResWnet). ResWnet consists of three modules: scatter correction module 𝒮, material decomposition module ℳ, decomposition denoising module 𝒟 . Both 𝒮 and 𝒟 use ResWnet architecture, and this lightweight model fuses multi-level features, achieving satisfied performance with a small number of parameters. 𝒮 acts on dual-energy attenuation projections to reduce the scatter contaminations, and 𝒟 acts on material composition projections to suppress the noise. ℳ links the modules 𝒮 and 𝒟, and is used for domain transform from attenuation projections to material projections. This process could be approximated by polynomials with pre-calibrated parameters, that is, ℳ is a known operator in proposed network with no trainable parameters. This helps to reduce model parameters and improve the performance with small training dataset. Using public head CT dataset, we simulated dual-energy cone-beam CT projections and material projections. Proposed ResWnet was trained, validated and tested on this simulated dataset, verifying its effectiveness in projection-domain scatter correction and low-noise decomposition.

1. INTRODUCTION

Conventional CT measures the spatial distribution of x-ray linear attenuation coefficient (LAC) [1]. Dual-energy CT (DECT) [2], which scans object with two different x-ray spectrums, extends the measurement to the energy dimension, and the quantitative information provided by DECT facilitates various new applications, including but not limited to electron density/stopping power calculation [3], synthesis of monochromatic images [4], virtual-non-enhanced images [5]. After several decades development, DECT has become a powerful tool in clinical diagnosis [6]. Another widely used CT is cone-beam CT (CBCT) [7]. Taking advantages of high spatial resolution, large volume coverage and open structure, CBCT provides flexible image guidance in image-guided radiotherapy [8] and image-guided intervention [9], and the flexible geometry is also well suitable for some dedicated clinical tasks, such as breast CT, extremity CT and dental CT [10].

Recently, some groups investigated the feasibility of dual-energy cone-beam CT [11-13] that combines the advantages of DECT and CBCT. Our group implemented a rotation filter [14] configuration to acquired dual-energy data within single rotation. We further proposed a joint bilateral filtering-based algorithm to suppress the image streaks and amplified decomposition noise [15]. However, photon scatter, a major issue in CBCT imaging [16], was not taken into consideration in our previous research. The scatter contamination could severely degrade the imaging accuracy, hampering quantitative dual-energy imaging. Moreover, according to the dual-energy imaging theory [17], image-domain decomposition cannot provide accurate material composition, as well as cannot eliminate beam-hardening effects. Polynomial fitting-based projection-domain decomposition [18] tackles this issue but it is sensitive to projection noise [19].

This work aims to perform projection-domain scatter correction and material decomposition. To this end, we designed a Residual W-shape Network (ResWnet), which consists of two cascade ResUnets. In order to reduce the model parameters and achieve satisfied performance using small training dataset, a known decomposition operator was used to link the two ResUnet.

2. METHODOLOGY

2.1

Dual-Energy Cone-Beam Projection Model

The polyenergetic forward model for dual-energy cone-beam projections is written as:

where l/h means physical quantities under low/high incident x-ray spectrum, p represents transmission photons, ω represents spectrum, μ stands for LAC. Due to low scanning dose and large volume coverage, CBCT projections suffer high noise level (n_l/h) and severe scatter contamination (s_l/h). Consequently, the actual measurements are modeled as:

The superscript n, s represent noise and scatter, respectively. After log normalization, the line integrals with and with scatter and noise are:

2.2

Projection-Domain Material Decomposition

According to the dual-energy CT theory [17], the LAC could be decomposed as:

ϕ_1/2 is energy-dependent basis function, which could be interpreted as mass attenuation coefficient of two basis materials, then ρ_1/2 is the density of the basis material correspondingly. Plug Eq. (5) into Eq. (3), we obtain:

where q₁ = ∫ρ₁(x)dl, q₂ = ∫ρ₂(x)dl. Given incident spectrum ω_l/h and two basis functions ϕ_1/2, the material decomposition is an inverse problem which recovers q_1/2 from q_l/h. Since no explicit expression for this inversion, A commonly used analytical decomposition method is polynomial approximation [18]:

where α_i,j, β_i,j are pre-calibrated coefficients using calibration phantom. Although this method could achieve high accuracy via increasing the polynomial order N, previous research revealed that this decomposition is sensitive to projection noise [19].

2.3

Residual W-shape Network

Figure 1 presents the ResWnet Architecture, which consists of scatter correction module, material decomposition module and decomposition denoising module. For simplicity, projections represent log-normalized projections hereafter.

Figure 1.

ResWnet architecture. Gray, blue and yellow boxes represent scatter correction, material decomposition, decomposition denoising module, respectively.

Scatter Correction Module

Scatter correction module aims to remove the noise and scatter signal in projections. In this work, scatter correction module 𝒮 employs a Residual U-shape network (ResUnet) [20]. The paired , are concatenated as two-channel input, which then passes six encoder blocks and five decoder blocks successively. To avoid gradient vanishing and exploding [21] in very deep networks, each encoder and decoder block adopts residual architecture [22] with a shortcut connection from input to output. Considering that and q_l/h share the same structure with only numerical difference, a global shortcut connection is applied to directly add raw projections to the output projections. Different from the conventional Unet, the last two encoder blocks in 𝒮 keep the same number of features to reduce model parameters without compromising the performance.

Material Decomposition Module(: q_l/h ↦ q_1/2)

Material decomposition module aims to recover the material composition projections q_1/2 from the dual-energy projections q_l/h. As discussed in Sec.II.B, this inversion could be approximated by polynomials functions, and the parameters α_i,j, β_i,j could be determined by pre-calibration. Thus, there is no need to train a sub network for decomposition. In proposed network, acts as depicted in Eqs. (7). N was set to 4 and there is no trainable parameter in this module.

Decomposition Denoising Module

Since the module 𝒮 cannot reduce the noise level to zero and the polynomial-based decomposition is sensitive to noise, the output of would be noisy. Decomposition denoising module 𝒟 was used for further suppress the noise. Same as the scatter correction module, 𝒟 also employs the ResUnet architecture

2.4

Data Generation and Network Training

To obtain the training, validation and testing datasets, we first download 22 head CT scans from public dataset in https://wiki.cancerimagmgarchive.net/pages/viewpage.action?pageid=39879146. These CT images were decomposed into four tissues: fat, muscle, 200mg/cc bone and 800mg/cc bone using multi thresholds segmentation. We performed polyenergetic forward projection of each volume using two spectrums generated by Spektr [23]. These are the label projections q_l/h for scatter correction. The input projections were then obtained via adding scatter and noise signals generated by Monte Carlo simulation. Muscle and 800mg/cc bone were selected as basis materials in this work, and fat and 200mg/cc bone were decomposed onto these two basis. The label projections of material decomposition q_1/2 were generated by forward projecting the muscle and 800mg/cc composition images. Each head produced 100 projections, and the projection angles equally distributed between 0 and 2π.

During the model training, network 𝒮 were firstly optimized by:

The second term was added because we hope 𝒮 could not only reduce the scatter signal, but also produce a projection noise distribution that minimizes the decomposition noise. In this work, w₁, w₂ were set to 0.9 and 0.1, respectively. Using the trained network 𝒮*, the network 𝒟 was finally trained by:

Parameters of both 𝒟 and 𝒮 were optimized by Adam optimizer with an initial learning rate of 0.0004 which decay 8% after each epoch. Batch size was set to 4 and training stopped after 100 epochs.

3. RESULTS

Corrected projections and decomposed material projections are obtained after the simulated cone-beam projections pass the first and the second ResUnet, respectively. Tomographic images are reconstructed via conventional FDK algorithm [24].

Figure 1 displays the dual-energy CT images. As in the left column, scatter contamination leads to obvious shading artifacts on soft tissue. The bone tissue, although clearly visualized, has a numerical error more than 10%. Proposed network successfully removes the image shading, with preservation of small bones and details in intracranial soft tissues.

On both soft tissues and bone tissues, proposed network reduced the error to less than 0.5%.

Figure 2 displays the material composition images. As discussed above, polynomial-fitting based decomposition in the left column suffers terrible noise, especially on the soft tissue images. Without significantly compromising the spatial resolution, proposed network suppresses the noise of bone composition and muscle composition images by 32.7% and 65.6%, respectively. Subtle details around nasal cavity are faithfully recovered by proposed method as well.

Figure 2.

Dual-energy CT images. Top and bottom rows are low-energy and high-energy CT images, respectively. Mean values of two boxed areas are listed below each image. Display window: [0.15,0.25]cm^-1

Figure 3.

Material composition images. Top and bottom rows are bone and muscle images, respectively. Std values of two boxed areas are listed below each image. Display window: bone: [0,0.8], muscle: [0.6 1.2]

4. CONCLUSION AND DISCUSSION

In this work, we proposed a ResWnet for dual-energy cone-beam CT imaging. Three modules were designed for scatter correction, material decomposition and decomposition denoising, respectively. To reduce the model complexity and optimize the performance using small training dataset, both two trainable modules perform transform between the same domain, and the domain transform from attenuation projection to material composition projection is achieved by a known operator, i.e., polynomial-based decomposition. Simulation study preliminarily demonstrated the performance of proposed methods on scatter correction and low-noise decomposition.

Furthermore, some details lost in the material images, especially on the soft tissue images, sharp bone edges were also blurred to some extent. More complex model is needed to improve the performance. However, current study only used 2000 projections for training and validation, which is not capable of training more complex models. More cone beam projections will be simulated in the next step. Another issue is that current simulated projections only account for the primary signal, photon scatter and noise., and other factors such as off focus and detector glare are not considered. More accurate forward projector is needed to simulate real projections, then trained model could be applied to physical measurements acquired in the real system.

REFERENCES

[1]

J. Hsieh, Computed tomography: principles, design, artifacts, and recent advances, SPIE press,2003). Google Scholar

[2]

A. Graser, T. R. Johnson, H. Chandarana, and M. Macari, “Dual energy CT: preliminary observations and potential clinical applications in the abdomen,” European radiology, 19 (1), 13 (2009). https://doi.org/10.1007/s00330-008-1122-7 Google Scholar

[3]

L. I. R. Garcia, J. F. P. Azorin, and J. F. Almansa, “A new method to measure electron density and effective atomic number using dual-energy CT images,” Physics in Medicine & Biology, vol, 61 (1), 265 (2015). https://doi.org/10.1088/0031-9155/61/1/265 Google Scholar

[4]

L. Yu, S. Leng, and C. H. McCollough, “Dual-energy CT–based monochromatic imaging,” American journal of Roentgenology, 199 (5), S9 –S15 (2012). https://doi.org/10.2214/AJR.12.9121 Google Scholar

[5]

L.-J. Zhang et al, “Liver virtual non-enhanced CT with dual-source, dual-energy CT: a preliminary study,” European radiology, 20 (9), 2257 –2264 (2010). https://doi.org/10.1007/s00330-010-1778-7 Google Scholar

[6]

M.-J. Kang, C. M. Park, C.-H. Lee, J. M. Goo, and H. J. Lee, “Dual-energy CT: clinical applications in various pulmonary diseases,” Radiographics, 30 (3), 685 –698 (2010). https://doi.org/10.1148/rg.303095101 Google Scholar

[7]

D. Jaffray and J. Siewerdsen, “Cone-beam computed tomography with a flat-panel imager: initial performance characterization,” Medical physics, 27 (6), 1311 –1323 (2000). https://doi.org/10.1118/1.599009 Google Scholar

[8]

X. Liang, Y. Jiang, and T. Niu, “Quantitative cone-beam CT imaging in radiotherapy: Parallel computation and comprehensive evaluation on the TrueBeam system,” IEEE Access, 2169 –3536 (2019). Google Scholar

[9]

M. Maybody, C. Stevenson, and S. B. Solomon, “Overview of navigation systems in image-guided interventions,” Techniques in vascular and interventional radiology, 16 (3), 136 –143 (2013). https://doi.org/10.1053/j.tvir.2013.02.008 Google Scholar

[10]

R. Fahrig, D. A. Jaffray, I. Sechopoulos, and J. W. Stayman, “Flat-panel conebeam CT in the clinic: history and current state,” Journal of Medical Imaging, 8 (5), 052115 (2021). https://doi.org/10.1117/1.JMI.8.5.052115 Google Scholar

[11]

R. Cassetta et al., “Fast-switching dual energy cone beam computed tomography using the on-board imager of a commercial linear accelerator,” Physics in Medicine & Biology, 65 (1), 015013 (2020). https://doi.org/10.1088/1361-6560/ab5c35 Google Scholar

[12]

L. E. Schyns et al., “Optimizing dual energy cone beam CT protocols for preclinical imaging and radiation research,” The British Journal of Radiology, 90 (1069), 20160480 (2017). https://doi.org/10.1259/bjr.20160480 Google Scholar

[13]

L. Shi et al., “Characterization and potential applications of a dual-layer flat-panel detector,” Medical physics, 47 (8), 3332 –3343 (2020). https://doi.org/10.1002/mp.v47.8 Google Scholar

[14]

C. Fang, G. Xu, and L. Zhu, “Single scan dual energy cone beam CT using a rotating filter,” Medical Imaging 2020: Physics of Medical Imaging, 11312 113123S (2020). Google Scholar

[15]

X. Jiang, C. Fang, P. Hu, H. Cui, L. Zhu, and Y. Yang, “Fast and effective single-scan dual-energy cone-beam CT reconstruction and decomposition denoising based on dual-energy vectorization,” Medical Physics, 48 (9), 4843 –4856 (2021). https://doi.org/10.1002/mp.v48.9 Google Scholar

[16]

L. Zhu, Y. Xie, J. Wang, and L. Xing, “Scatter correction for cone-beam CT in radiation therapy,” Medical physics, 36 (6), 2258 –2268 (2009). https://doi.org/10.1118/1.3130047 Google Scholar

[17]

R. E. Alvarez and A. Macovski, “Energy-selective reconstructions in x-ray computerised tomography,” Physics in Medicine & Biology, 21 (5), 733 (1976). https://doi.org/10.1088/0031-9155/21/5/002 Google Scholar

[18]

P. Stenner, T. Berkus, and M. Kachelriess, “Empirical dual energy calibration (EDEC) for cone-beam computed tomography,” Medical physics, 34 (9), 3630 –3641 (2007). https://doi.org/10.1118/1.2769104 Google Scholar

[19]

M. Petrongolo, X. Dong, and L. Zhu, “A general framework of noise suppression in material decomposition for dual-energy CT,” Medical physics, 42 (8), 4848 –4862 (2015). https://doi.org/10.1118/1.4926780 Google Scholar

[20]

Z. Liu and H. Yuan, “An Res-Unet Method for Pulmonary Artery Segmentation of CT Images,” in Journal of Physics: Conference Series, 012018 (2021). Google Scholar

[21]

S. Al-Abri, T. Lin, M. Tao, and F. Zhang, “A Derivative-Free Optimization Method With Application to Functions With Exploding and Vanishing Gradients,” IEEE Control Systems Letters, (99), 1 –1 (2020). Google Scholar

[22]

K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 770 –778 (2016). Google Scholar

[23]

J. Punnoose, J. Xu, A. Sisniega, W. Zbijewski, and J. Siewerdsen, “spektr 3.0—A computational tool for x-ray spectrum modeling and analysis,” Medical physics, 43 (8), 4711 –4717 (2016). https://doi.org/10.1118/1.4955438 Google Scholar

[24]

L. A. Feldkamp, L. C. Davis, and J. W. Kress, “Practical cone-beam algorithm,” Journal of the Optical Society of America A, 1 (6), 612 –619 (1984). https://doi.org/10.1364/JOSAA.1.000612 Google Scholar

Citation Download Citation

Xiao Jiang, Hehe Cui, Zihao Liu, and Lei Zhu "Residual W-shape network (ResWnet) for dual-energy cone-beam CT imaging", Proc. SPIE 12304, 7th International Conference on Image Formation in X-Ray Computed Tomography, 1230427 (17 October 2022); https://doi.org/10.1117/12.2646505

Access the abstract

PROCEEDINGS
5 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Bone

Computed tomography

Tissues

Performance modeling

Denoising

Dual energy imaging

X-ray computed tomography

1.

INTRODUCTION

2.

METHODOLOGY

2.1

Dual-Energy Cone-Beam Projection Model

2.2

Projection-Domain Material Decomposition

2.3

Residual W-shape Network

Figure 1.

2.4

Data Generation and Network Training

3.

RESULTS

Figure 2.

Figure 3.

4.

CONCLUSION AND DISCUSSION

REFERENCES

Show All Keywords

Keywords/Phrases

Search In:

Publication Years