NA-MIC Project WeeksThe popularity and use of foundation models (FMs) have exploded in recent years. Within the medical imaging field alone, numerous models have been developed to support various downstream tasks, including classification and segmentation.
However, as a user, it’s hard to understand the embeddings that the models produce. Also, it’s hard to figure out: 1) which model to use, and 2) what tasks the model supports.
In this project, we plan to explore how the cloud can help us understand these embeddings from various FMs. Recently, we have extracted embeddings from lung cancer tumors in the National Lung Screening Trial (NLST) CT dataset from 9 different models. We will use the latest features in the Google Cloud Platform to help us explore and understand these embeddings.
We are most interested in: 1) how these embeddings can be visualized, and if clusters are visible, 2) if these embeddings can be used to find similar patients, and 3) if they can actually be used for other tasks
Possible extensions to this work: 1) We could explore the embeddings that Google has provided from their pathology foundation model here and here. 2) We could extend this exploration to lung ultrasound images, where visualizing the embedding space could help us choose representative and diverse images for expert annotation
How could it relate to Slicer and Imaging Data Commons (IDC)? Given a sample patient image, we could retrieve the k-closest patients in IDC.
Initial analysis (no cloud) of using embeddings for lung cancer histology classification:
Initial analysis (no cloud) of using embeddings for lung cancer staging classification:
No response