A New Tool for Analyzing Gene Activity in Tissues
NoButter helps improve spatial transcriptomics data quality for better tissue analysis.
Béibhinn O’Hora, Roman Laddach, Rosamond Nuamah, Elena Alberts, Isobelle Wall, Joseph Bell, David A Johnston, Sonya James, Jeanette Norman, Mark G. Jones, Ciro Chiappini, Anita Grigoriadis, Jelmar Quist
― 6 min read
Table of Contents
Spatial Transcriptomics is a fancy way of looking at how genes are active in different parts of a tissue. Think of it like reading a book where each page is actually a piece of tissue. Each word on the page is a gene, and now we can see where each gene is active in that tissue. This is important because knowing where these genes are talking can help us understand how Tissues are built and how they work.
The Tech Behind It
There are several new tools for doing this kind of work, and they can be broken down into two main types: sequencing-based and Imaging-based technologies. Sequencing looks at gene activity in a more generalized way, while imaging gives us a closer look, almost like using a camera to zoom in on specific parts of a page. However, with all this new technology, we’re still figuring out how to check if the data we get is good quality.
What’s the Big Deal About Imaging?
When using imaging techniques like the CosMx Spatial Molecular Imager or Xenium, the exact position of each gene's activity is really important. If we don’t know where a gene is located, it becomes hard to say which cell it belongs to, and that can mess up our results. Right now, scientists often use quality checks that were created for regular RNA-sequencing. But they really need special checks just for these new imaging methods.
Transcript Detection and Challenges
In imaging, we capture images at different levels, sort of like taking slices of bread from a loaf. These slices show us how genes are behaving in this tissue world. With the CosMx SMI, these slices are taken very closely together, allowing us to see small changes in gene activity.
However, there’s a catch. Genes can sometimes move a bit too much from where they should be in the tissue. Normally, if the genes are evenly spread, you’d expect to see a balanced number of gene signals in each slice. But that’s not always what happens. For example, in tests on lymph nodes, more gene signals were found in the upper slices compared to the lower ones. Meanwhile, in lung tissue and some breast cancer samples, it was the opposite. This suggests that genes are playing a bit of a hide-and-seek game - not where we expect them to be!
On top of this, when we looked at slices close to the glass slide where the tissue sits, we noticed that more genes were showing up outside of the cells they belong to. This is a problem for data analysis because it can introduce a lot of noise. This noise is like a crowd of chatter that makes it hard to hear the important messages.
Meet NoButter
To tackle the chaos caused by wandering genes, we created something called NoButter. This is a tool for researchers that helps them deal with the messiness of the data. Picture it as a special cleaning tool that helps sort out the signals in the data. NoButter helps visualize how these gene signals are spread out in the slices, and it can get rid of signals that are in the wrong place. It also creates files that are ready for analyzing the data further.
How NoButter Works
So, how does NoButter clean up this data? First, researchers take the raw data from the imaging tools. This data contains all the information about where each gene signal is located. NoButter has several functions that help look closely at the data to check how many gene signals are outside of the cells they should be in. Since finding these misplaced signals can be tricky, especially in areas with a lot of cells, we suggest looking at slices on the edges of the tissue. These slices often provide a clearer picture.
Once NoButter identifies the messy signals, it helps researchers filter them out, creating a new, cleaner set of data. The package also organizes the data files to make them easy to work with for later analysis. It's like putting your messy room in order so you can find your things later.
Real-Life Testing of NoButter
To see how well NoButter works, we tested it with data from various samples, including lymph nodes and lung tissues. After preparing the slides according to certain guidelines, we analyzed the data and found a massive number of gene signals. In one case, we had over 19 million signals in lymph nodes, but a small percentage were in the right spots. A similar pattern was observed in lung samples and breast cancer samples.
As we dug deeper, we found that a big chunk of gene signals was in the slices closest to the glass slide, where the tissue was starting to lose its shape. This makes it harder to tell which gene signals belong where. By applying NoButter, we managed to clean up a significant number of mismatched signals in each sample. We ended up with a new set of high-quality gene signals, ready for further analysis.
In Summary
NoButter provides a handy toolbox for scientists to detect, fix, and improve the quality of data that comes from spatial transcriptomics. By removing misplaced gene signals, we can enhance the overall quality of our findings. This helps researchers understand how tissues function and can lead to better insights into health and disease.
The exciting part is that NoButter is available for everyone! Researchers can easily access it, along with example data and a full tutorial on how to use it. As we continue to develop NoButter, we aim to make it compatible with even more imaging techniques in the future.
So, there you have it! The wild world of spatial transcriptomics made a little simpler, and with a nifty tool to help clean things up along the way. The next time you think about the inner workings of tissues, remember that beneath the surface, there’s a whole lot of gene chatter happening, and now we have a way to make sense of it!
Title: NoButter: An R package for reducing transcript dis-persion in CosMx Spatial Molecular Imaging Data
Abstract: MotivationAdvances in spatial transcriptomics technologies at single-cell resolution have high-lighted the need for innovative quality assessment approaches and improved analytical tools. Imaging-based spatial transcriptomics technologies, such as the CosMx Spatial Molecular Imager (SMI), provide the location and abundance of transcripts through multifocal imaging. Optical sections (or Z-slices) form a Z-stack that represents the tissue depth. Transcript dispersion can be observed across these Z-slice and introduce considerable levels of technical noise to the data that can negatively impact downstream analysis. Package FunctionalityNoButter is an R package designed to evaluate transcript dispersion in CosMx SMI spatial transcriptomics data. Using the raw data, the transcript distribution is assessed for each Z-slice of a Z-stack across multiple fields of views (FOVs). To systematically identify transcript dispersion, the percentage of transcripts located outside cell boundaries is calculated. Z-slices exhibiting high levels of transcript dispersion can be excluded, while high-confidence transcripts are preserved. Usage ScenarioTo demonstrate the functionalities of NoButter, spatial transcriptomics data was generated using the CosMx SMI for lymph node tissue, a lung sample, and two triple-negative breast cancers (TNBCs). Use cases illustrate substantial transcript dispersion in optical planes closer to the glass slide. In these Z-slices, on average, an additional 10% of the transcripts were discarded using NoButter. Cleaning such Z-slices with high dispersion rates reduces technical noise and improves the overall quality of the spatial transcriptomics data. AvailabilityThe package can be accessed at https://github.com/cancerbioinformatics/NoButter.
Authors: Béibhinn O’Hora, Roman Laddach, Rosamond Nuamah, Elena Alberts, Isobelle Wall, Joseph Bell, David A Johnston, Sonya James, Jeanette Norman, Mark G. Jones, Ciro Chiappini, Anita Grigoriadis, Jelmar Quist
Last Update: 2024-11-28 00:00:00
Language: English
Source URL: https://www.biorxiv.org/content/10.1101/2024.11.25.625243
Source PDF: https://www.biorxiv.org/content/10.1101/2024.11.25.625243.full.pdf
Licence: https://creativecommons.org/licenses/by-nc/4.0/
Changes: This summary was created with assistance from AI and may have inaccuracies. For accurate information, please refer to the original source documents linked here.
Thank you to biorxiv for use of its open access interoperability.