🧼SC2Spa: Single Cell to Spatial Mapping🚀, 💥Blastn2dotplots: Multiple Dot-plot Visualizer, 🔪KILDA: Identifying KIV-2 Repeats From kmers🧬

🧼SC2Spa: Single Cell to Spatial Mapping🚀, 💥Blastn2dotplots: Multiple Dot-plot Visualizer, 🔪KILDA: Identifying KIV-2 Repeats From kmers🧬

Bioinformer Weekly Roundup

Stay Updated with the Latest in Bioinformatics!

Issue: 89 | Date: 6 June 2025

👋 Welcome to the Bioinformer Weekly Roundup!

In this newsletter, we curate and bring you the most captivating stories, developments, and breakthroughs from the world of bioinformatics. Whether you are a seasoned researcher, a student, or simply curious about the intersection of biology and data science, we have got you covered. Subscribe now to stay ahead in the exciting realm of Bioinformatics!

🔬 Featured Research

De Novo sequencing-assisted homology search for DIA data analysis enables low abundance peptide variants discovery | bioRxiv

This study explores the use of De Novo sequencing combined with homology searches to analyze Data-Independent Acquisition (DIA) data. The approach claims to enhance the discovery of low abundance peptide variants, providing deeper insights into proteomic data.

Single-cell morphological profiling reveals insights into programmed cell death | bioRxiv

The research utilizes single-cell morphological profiling to investigate programmed cell death. By analyzing individual cell morphologies, the study uncovers detailed mechanisms and pathways involved in cell death processes.

The divergent intron-containing actin in sponge morphogenetic processes | NAR Genomics and Bioinformatics

This paper examines the role of divergent intron-containing actin genes in sponge morphogenesis. The findings highlight the unique genetic and developmental processes in sponges, contributing to our understanding of early metazoan evolution.

Integrating gene expression, genomic, and phosphoproteomic data to infer transcription factor activity in lung cancer | Oxford Academic

This study integrates transcriptomic, genomic, and phosphoproteomic data to infer transcription factor (TF) activity in lung adenocarcinoma. From 1667 TFs, 34 showed altered activity based on target gene expression. Phosphoproteomic data identified phosphorylation events modulating TF function. ERG emerged as a key TF with activity correlating to patient survival, mentioning the utility of multi-omics integration for TF activity inference.

Integrated transcriptomic and metabolomic analyses reveal tissue-specific accumulation and expression patterns of monoterpene glycosides, gallaglycosides, and flavonoids in Paeonia Lactiflora Pall. | BMC Genomics

Researchers analyzed four tissues of Paeonia lactiflora using transcriptomics and metabolomics to map the biosynthesis of monoterpene glycosides (MGs), gallaglycosides (GGs), and flavonoids. Tissue-specific accumulation patterns were identified, with roots showing the highest metabolite levels. Key biosynthetic genes, including CYP71AN24.1 and UGT91A1.1, were identified through correlation and network analyses.

Comparative study on the mechanism of yellow petal formation in Paphiopedilum armeniacum: an integrated transcriptomic and metabolomic analysis of three Paphiopedilum species | BMC Genomics

This study compared three Paphiopedilum species to uncover mechanisms behind yellow petal formation in P. armeniacum. Metabolomic profiling revealed increased carotenoids like zeaxanthin. Transcriptomic analysis identified 40 differentially expressed genes, including PSY and CYP97B3, linked to carotenoid biosynthesis. Integrated analyses suggest coordinated regulation of pigment-related genes and metabolites.

Flexible, production-scale, human whole genome sequencing on a benchtop sequencer | BMC Genomics

The study demonstrates high-throughput human whole genome sequencing (hWGS) using a benchtop sequencer. A total of 807 samples were sequenced with high quality metrics (median Q30: 96.6%). The platform supported rapid sequencing (<12 hours) and flexible library configurations, enabling cost-effective, scalable hWGS in smaller labs.

Transcriptomic analysis reveals genetic factors underlying impaired symbiotic nitrogen fixation in lines derived from crosses between cultivated peanut (Arachis hypogaea L.) and its wild ancestors | BMC Genomics

Transcriptomic comparisons between cultivated peanut and wild-derived lines with impaired nitrogen fixation identified altered expression in symbiosis-related genes. The gene FEN1, essential for nitrogenase activity, was notably under expressed in the hybrids. These findings suggest FEN1 as a candidate gene affecting symbiotic efficiency in peanut.

Placental whole transcriptome expression profile in patients with early-onset, late-onset preeclampsia and gestational diabetes mellitus | Scientific Reports

This study used whole-transcriptome sequencing of placental samples to investigate molecular differences in early-onset and late-onset preeclampsia (PE) and gestational diabetes mellitus (GDM). It identified distinct pathway dysregulations, including Wnt and PI3K-Akt in early-onset PE, and immune-related pathways in late-onset PE and GDM. Immune cell infiltration patterns varied across conditions, and specific competing endogenous RNA networks were associated with each disorder.

The high-quality telomere-to-telomere genome assembly of the earthworm (Amynthas aspergillum) | Scientific Data

Researchers produced a high-quality telomere-to-telomere genome assembly of the earthworm Amynthas aspergillum, covering 98.43% of the genome and identifying over 35,000 protein-coding genes. The assembly provides a genomic foundation for studying evolutionary mechanisms and potential pharmacological properties of this species. Chromosomal anchoring and telomere detection were also detailed.

Single-molecule direct RNA sequencing reveals the shaping of epitranscriptome across multiple species | Nature Communications

This work introduced a deep learning model, SingleMod, for detecting N6-methyladenosine (m6A) modifications at single-molecule resolution using direct RNA sequencing. The model enabled transcriptome-wide mapping of m6A in human cells and comparative analysis across eight species. Findings highlighted heterogeneity in m6A distribution and suggested phylogenetically linked regulatory patterns.

Comparative macrogenomics reveal plateau adaptation of gut microbiome in cervids | BMC Biology

The study generated a comprehensive gut microbiome catalog from cervids, identifying over 41,000 metagenome-assembled genomes. Comparative analyses revealed coevolution between hosts and microbiota, with plateau-dwelling cervids showing genomic adaptations in energy metabolism pathways. Structural variations in microbial genomes were linked to environmental adaptation mechanisms.

🛠️ Latest Tools

SC2Spa: a deep learning based approach to map transcriptome to spatial origins at cellular resolution | BMC Bioinformatics

SC2Spa introduces a deep learning framework that infers the spatial coordinates of single cells from transcriptomic data. The method bridges the gap between single-cell RNA sequencing and spatial transcriptomics by accurately mapping cells onto tissue architectures. It demonstrates robust performance across various spatial resolutions and platforms while also highlighting spatially variable gene expression.

The source code is available here.

Blastn2dotplots: multiple dot-plot visualizer for genome comparisons | BMC Bioinformatics

Blastn2dotplots is a Python-based tool that transforms BLASTN output into customizable dot-plots for genome comparisons. By treating each alignment as a separate subplot, it allows independent axis control and flexible visualization of structural variations, repeat regions, and sequence similarities. This approach facilitates detailed comparative genomic analyses without concatenating multiple sequences into one continuous axis.

The source code is available here.

MultiVis.js: a software tool for the visualization of multiway chromatin interactions and SPRITE data | BMC Bioinformatics

MultiVis.js  provides an interactive visualization platform for multiway chromatin interactions derived from SPRITE data. The tool generates three-dimensional representations of genomic contacts, enabling researchers to explore complex chromatin architectures and retrieve detailed cluster information. Its user-friendly features simplify the identification of genomic regions involved in higher-order interactions.

The source code is available here.

MAFcounter: an efficient tool for counting the occurrences of k-mers in MAF files | BMC Bioinformatics

MAFcounter efficiently extracts and counts k-mers from multiple sequence alignments in MAF format using a multithreaded, producer-consumer approach. Supporting both canonical and reverse-complement k-mer analysis, it delivers flexible output options for downstream comparative analyses. This tool is designed to handle large-scale genomic or proteomic datasets with speed and minimal memory demands.

The source code is available here.

OmniSegger: A time-lapse image analysis pipeline for bacterial cells | PLOS Computational Biology

OmniSegger is an automated pipeline for the analysis of time-lapse microscopy images of bacterial cells. It integrates robust segmentation algorithms to accommodate varying cell morphologies and diverse imaging modalities, enabling quantitative cell tracking and cytometry. The pipeline efficiently transforms raw microscopy data into comprehensive datasets that support dynamic bacterial analysis over time.

The source code is available here.

KILDA: identifying KIV-2 repeats from kmers | Oxford Academic

KILDA is a Nextflow pipeline that estimates the number of kringle IV type 2 repeats in the LPA gene directly from kmer counts in FASTQ files. It calculates repeat copy number by comparing counts from KIV-2-specific kmers against normalization regions. Validated on datasets such as the 1000 Genomes Project, the approach offers an alignment-free method for assessing repeat number variations associated with cardiovascular risk factors.


IEIVariantFilter: a bioinformatics tool to speed up genetic diagnosis of inborn errors of immunity patients | Oxford Academic

IEIVariantFilter is a web-based tool developed to streamline the analysis of whole-exome sequencing data for diagnosing inborn errors of immunity. It automates variant filtering by prioritizing entries based on zygosity, read quality, and predicted functional impact, particularly focusing on immune-related genes. Designed to be user-friendly for those without extensive bioinformatics expertise, it aims to reduce manual effort in variant prioritization.

The software is available here.

CellVoyager: AI CompBio Agent Generates New Insights by Autonomously Analyzing Biological Data | bioRxiv

The study introduces CellVoyager, an AI-driven computational tool engineered to autonomously analyze complex biological data, including imaging and omics datasets. It outlines an automated pipeline that integrates machine learning algorithms for extracting cellular features and uncovering patterns in data. The manuscript details the system’s design, workflow, and potential applications in cellular research without offering subjective evaluations.

The source code is available here.

ProtMamba: a homology-aware but alignment-free protein state space model | bioRxiv

This work presents ProtMamba, a protein language model built on a state space (Mamba) architecture that eschews traditional multiple sequence alignments. It employs a fill-in-the-middle training objective that combines autoregressive and masked language modeling, enabling the generation of novel protein sequences and prediction of protein fitness. The study systematically describes the model’s design, training strategy, and applications in protein design.

The source code is available here.

IGCN: Integrative Graph Convolution Networks for patient level insights and biomarker discovery in multi-omics integration | Oxford Academic

The paper details IGCN, a neural network framework that uses graph convolution networks to integrate multi-omics datasets for patient-level analyses. It focuses on capturing the differential emphasis of various omics data, enabling the identification of salient biomarkers, and supporting cancer subtype classification. The study provides a technical account of the network architecture, data integration strategy, and comparative assessments with existing methods without endorsing any specific approach.

The source code is available here.

📰 Community News

Elucidata, Sapien Biosciences Partner to Make Biobank Samples Accessible to AI | GenomeWeb Elucidata and Sapien Biosciences are collaborating to convert Sapien’s cancer biobank—comprising genomic, transcriptomic, and histopathology data from over 85,000 patients—into AI-ready datasets. Using Elucidata’s Polly platform, the partnership aims to standardize and enrich these data for applications in drug discovery and diagnostics, including synthetic data generation for diseases beyond oncology.

Gene editing can trigger inflammatory, senescence-like responses in blood stem cells | News Medical Life Sciences CRISPR-based gene editing in hematopoietic stem cells can induce inflammatory signaling and senescence-like states. These responses may affect cell viability and differentiation, raising considerations for therapeutic applications involving edited blood stem cells.

Researchers map DNA regions that activate liver regeneration | News Medical Life Sciences

Researchers from the University of Barcelona and collaborators mapped genome-wide regulatory DNA elements involved in liver regeneration using mouse models post-hepatectomy. They identified enhancers—both regeneration-specific and reactivated developmental ones—that orchestrate gene expression during regeneration. The study also found repression of enhancers linked to liver metabolic functions. These findings highlight parallels between liver regeneration and embryonic development, offering insights into chromatin dynamics and transcriptional control during tissue repair.

Early brain wiring holds clues to infant emotional development | News Medical Life Sciences Researchers used advanced MRI (NODDI) to study white matter microstructure in 3-month-old infants and found that early brain wiring predicts emotional development over the next six months. Higher neurite dispersion in the forceps minor was linked to increased negative emotionality, while complex structure in the left cingulum bundle correlated with positive emotions and better self-soothing. These findings suggest early neural architecture influences emotional trajectories and may help identify infants at risk for future emotional challenges.

New compound infuzide shows promise against resistant bacterial infection | News Medical Life Sciences

Researchers synthesized a new compound, infuzide, which demonstrated antimicrobial activity against resistant strains of Staphylococcus aureus and Enterococcus in lab and mouse models. Infuzide, a hydrazone-based molecule, acts via mechanisms distinct from existing antibiotics, potentially reducing resistance development. It outperformed vancomycin in reducing bacterial colonies and showed enhanced effects when combined with linezolid. The compound was synthesized using solvent-free methods, supporting scalable and eco-friendly production.

Male mice can grow ovaries if their pregnant mums are iron deficient | Nature A study found that iron deficiency during pregnancy in mice can disrupt fetal sex determination, leading genetically male (XY) embryos to develop ovaries instead of testes. The deficiency alters the expression of key sex-determining genes, including Sry, which is crucial for testis development. This shift results in the activation of ovarian pathways in male embryos. The findings suggest maternal nutrition can influence epigenetic regulation of fetal development, with potential implications for understanding disorders of sex development.

📅 Upcoming Events

A FAIR and scalable workflow for plankton phenogenomics at single cell level | EMBL-EBI Training

This training covers high-throughput single-cell approaches in marine sciences, focusing on integrating single-cell sequencing and imaging. It aims to create a scalable workflow for phenogenomics, ensuring data is FAIR. 

From Insight to Impact: A Conversation on the Strategic Role of Biomarkers in Drug Discovery and Clinical Trials | labroots

This discussion focuses on the integration of biomarkers in drug development and clinical trials. Industry experts explore how biomarkers can reduce risk, improve success rates, and accelerate timelines in drug discovery, emphasizing their strategic importance in clinical research.

📚 Educational Corner

Permutations and SHAPley values for feature importance in techtonique dot net’s API (with R + Python + the command line) | R-bloggers

The article explores two model-agnostic techniques—Permutation Importance and SHAP (SHapley Additive exPlanations) values—for assessing feature importance in machine learning models. It demonstrates how to use Techtonique.net’s API with R, Python, and command-line tools to apply these methods on classification and regression datasets. The process includes sending data via API requests, retrieving results, and visualizing feature contributions. Both global and local interpretability aspects are addressed through practical examples.

April 2025 Top 40 New CRAN Packages | R-bloggers

The article highlights 40 notable R packages released on CRAN in April 2025, spanning diverse domains such as archaeology, artificial intelligence, biology, chemistry, and climate science. Each package is briefly described with its core functionality, including tools for radiocarbon calibration, LLM-based data cleaning, circadian clock modeling, and atmospheric simulations. The post emphasizes practical applications and includes links to vignettes and documentation for further exploration.

Porting my favorite RStudio color theme to Positron | R-bloggers

This post details the process of adapting the "Tomorrow Night Bright" RStudio theme for use in Positron, a new IDE by Posit. It discusses differences in syntax highlighting and UI theming between the two environments. The author uses an R script from a companion package to demonstrate theme fidelity and highlights the use of rainbow parentheses and Open VSX extensions for customization.

Introducing Docker Hardened Images: Secure, Minimal, and Ready for Production | Docker

Docker introduces Hardened Images (DHI), a new class of secure-by-default container images designed for production environments. These images feature a significantly reduced attack surface and are continuously updated to minimize known vulnerabilities. They support major Linux distributions and integrate with popular DevOps and security tools, aiming to streamline secure software delivery.

How to make a multi-group dotplot for single-cell RNAseq data | Chatomics

Using a PBMC dataset with two experimental conditions, this tutorial demonstrates how to generate multi-group dotplots for single-cell RNA-seq data with Seurat and custom R code. In these plots, dot size reflects the percentage of cells expressing a given gene, while color intensity represents the average expression level of that gene within a cell type. The guide also covers steps for data normalization, integration, clustering, and visualization.

How to add boxplots or density plots side-by-side a scatterplot: a single cell case study | Chatomics

This tutorial uses a PBMC single-cell RNA-seq dataset to demonstrate how to enhance scatterplots with side-by-side boxplots or density plots using the ggside R package. It shows how to visualize gene expression relationships (e.g., CD14 vs. FCGR3A) across monocyte subtypes, enabling clearer interpretation of expression patterns. The guide includes data extraction from Seurat objects and customization of plot aesthetics.

Connect with Us

Stay connected and engage with us on social media for daily updates, discussions, and more!

📬 Subscribe

Don't miss an issue! Subscribe to the Bioinformer Weekly Roundup and receive the latest insights directly in your inbox.

Subscribe Now

We hope you enjoyed this week's edition of the Bioinformer Weekly Roundup. Feel free to share it with your colleagues and friends who share your passion for bioinformatics!


Disclaimer: The information provided in this newsletter is for educational and informational purposes only and does not constitute professional advice.

Contact: bioinformatics@zifornd.com

Copyright © 2025, Bioinformer Weekly Roundup. All rights reserved.



To view or add a comment, sign in

Others also viewed

Explore content categories