TY - GEN
T1 - Enhanced RFI Detection in Imbalanced Astronomical Observations Using Weakly Supervised GANs
AU - Sharma, Arush S.
AU - Krunz, Marwan
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Radio astronomical observations are plagued by radio frequency interference (RFI) caused by a variety of man-made signals, e.g., cellular, automotive radar, satellite, GPS, etc. Both the wireless and radioastronomy communities have great interest in identifying RFI in the vicinity of the telescopes in real-time. This paper proposes the use of Generative Adversarial Networks (GANs) in the context of weakly supervised learning as a means to enhance the machine learning (ML) based detection of RFI in captured astronomical scans. Weakly supervised training is particularly appropriate when only a small subset of captured data is labeled, as is the case with many radioastronomical datasets. Such 'class imbalance' in the training dataset hampers the classifier's performance, particularly in terms of identifying the minority class samples with RFI, which is often the class of greater interest. Applying weakly or semi-supervised training to GANs addresses the class imbalance challenges. Our study is based on scans obtained from the 12-meter Alma-like Observatory at Kitt Peak, Arizona. We rely on the experience of radio astronomers to manually label the channels in a small fraction of the captured scans as 'clean' or 'dirty'. The remaining channels of 4 G Hz of the observed spectrum (95% of the scans) are unlabeled. We first use our human-labeled data as ground truth and train a baseline classifier in a supervised manner. Subsequently, we explore two approaches for weakly supervised learning. The first approach uses a combination of an autoencoder and conditional GAN, while the second approach uses a semi-supervised GAN (SGAN). Both techniques harness the features learned from the unlabeled dataset to train the generator and discriminator of a GAN. In the first approach, the trained generator is used to synthesize the dirty data, while in the second approach, the trained discriminator is modified to act as a clean/dirty RFI classifier. Simulations under extremely imbalanced training samples show that the SGAN approach can significantly improve the Fl-score and True Positive Rate (TPR) relative to the baseline classifier.
AB - Radio astronomical observations are plagued by radio frequency interference (RFI) caused by a variety of man-made signals, e.g., cellular, automotive radar, satellite, GPS, etc. Both the wireless and radioastronomy communities have great interest in identifying RFI in the vicinity of the telescopes in real-time. This paper proposes the use of Generative Adversarial Networks (GANs) in the context of weakly supervised learning as a means to enhance the machine learning (ML) based detection of RFI in captured astronomical scans. Weakly supervised training is particularly appropriate when only a small subset of captured data is labeled, as is the case with many radioastronomical datasets. Such 'class imbalance' in the training dataset hampers the classifier's performance, particularly in terms of identifying the minority class samples with RFI, which is often the class of greater interest. Applying weakly or semi-supervised training to GANs addresses the class imbalance challenges. Our study is based on scans obtained from the 12-meter Alma-like Observatory at Kitt Peak, Arizona. We rely on the experience of radio astronomers to manually label the channels in a small fraction of the captured scans as 'clean' or 'dirty'. The remaining channels of 4 G Hz of the observed spectrum (95% of the scans) are unlabeled. We first use our human-labeled data as ground truth and train a baseline classifier in a supervised manner. Subsequently, we explore two approaches for weakly supervised learning. The first approach uses a combination of an autoencoder and conditional GAN, while the second approach uses a semi-supervised GAN (SGAN). Both techniques harness the features learned from the unlabeled dataset to train the generator and discriminator of a GAN. In the first approach, the trained generator is used to synthesize the dirty data, while in the second approach, the trained discriminator is modified to act as a clean/dirty RFI classifier. Simulations under extremely imbalanced training samples show that the SGAN approach can significantly improve the Fl-score and True Positive Rate (TPR) relative to the baseline classifier.
KW - Autoencoder
KW - CNN
KW - GANs
KW - Machine Learning
KW - RF Interference
KW - radioastronomy
UR - http://www.scopus.com/inward/record.url?scp=85202445636&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85202445636&partnerID=8YFLogxK
U2 - 10.1109/ICCWorkshops59551.2024.10615823
DO - 10.1109/ICCWorkshops59551.2024.10615823
M3 - Conference contribution
AN - SCOPUS:85202445636
T3 - 2024 IEEE International Conference on Communications Workshops, ICC Workshops 2024
SP - 323
EP - 328
BT - 2024 IEEE International Conference on Communications Workshops, ICC Workshops 2024
A2 - Valenti, Matthew
A2 - Reed, David
A2 - Torres, Melissa
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 Annual IEEE International Conference on Communications Workshops, ICC Workshops 2024
Y2 - 9 June 2024 through 13 June 2024
ER -