🦌 inmoose: Multi-Omic Differential Expression in Python,🌱EDAmame: Exploratory Data Analysis, 🚀GOBoost: Predicting Protein Function with GO terms

🦌 inmoose: Multi-Omic Differential Expression in Python,🌱EDAmame: Exploratory Data Analysis, 🚀GOBoost: Predicting Protein Function with GO terms

Stay Updated with the Latest in Bioinformatics!

Issue: 92 | Date: 27 June 2025

👋 Welcome to the Bioinformer Weekly Roundup!

In this newsletter, we curate and bring you the most captivating stories, developments, and breakthroughs from the world of bioinformatics. Whether you are a seasoned researcher, a student, or simply curious about the intersection of biology and data science, we have got you covered. Subscribe now to stay ahead in the exciting realm of Bioinformatics!

🔬 Featured Research

Differential expression analysis with inmoose, the integrated multi-omic open-source environment in Python | BMC Bioinformatics

This study introduces InMoose, an open‐source Python framework that ported key R-based differential expression tools—limma, edgeR, and DESeq2—for the analysis of bulk transcriptomic data. The software replicates established results while streamlining interoperability between Python and R pipelines. InMoose also incorporates batch effect correction and quality control features, providing a complete multi-omic analysis environment.

Synthetic method of analogues for emerging infectious disease forecasting | PLOS Computational Biology

The authors present the Synthetic Method of Analogues (sMOA), which extends the traditional Method of Analogues by using a library of synthetically generated time series segments to forecast disease trends. This approach circumvents the historical data scarcity typical of emerging epidemics. The method achieves competitive forecasting accuracy and includes a tailored uncertainty quantification strategy for early outbreak scenarios.

Impact of symmetry in local learning rules on predictive neural representations and generalization in spatial navigation | PLOS Computational Biology

The study examines how temporal symmetry in local synaptic learning rules affects the formation of predictive neural representations, akin to successor representations used in spatial navigation. Using a two-layer network inspired by hippocampal circuitry, the authors find that symmetric rules can enhance generalization in symmetric environments. The work provides insights into the neural coding of spatial structure without endorsing any particular method.

Lateralised memory networks may explain the use of higher-order visual features in navigating insects | PLOS Computational Biology

Focusing on insect navigation, this research employs an anatomically constrained model of bilateral memory networks to show how complex visual features—such as the fractional position of mass—can emerge naturally. The study finds that balanced memory processing across hemispheres enables the implicit encoding of spatial cues, even with novel visual inputs. These observations shed light on emergent higher-order visual processing in insect brains.

Integrating differential expression under drought with gene family expansion unique to drought-tolerant species prioritizes candidate genes for drought adaptation in Brassicaceae species | BMC Genomics

By combining gene expression changes under drought with unique gene family expansions in drought-tolerant Brassicaceae species, the authors narrow down candidate genes involved in drought adaptation. The approach highlights enrichment in pathways related to root development and other stress-adaptive functions, and it detects signatures of diversifying selection in candidate families. This integrative strategy offers a focused list of targets for functional and crop-improvement studies.

β-Elemonic acid mediated enrichment of Paenibacillus to help Salvia miltiorrhiza Bunge alleviate drought stress | Microbiome

This study explores a plant–microbe interaction where β-elemonic acid, a triterpene acid secreted by Salvia miltiorrhiza roots, enriches beneficial Paenibacillus populations under drought conditions. Integrated amplicon sequencing, metabolomics, and physiological analyses reveal that this interaction improves water retention, nutrient uptake, and chlorophyll content, thereby enhancing drought resilience. The findings clarify a mechanistic link between plant metabolite secretion and microbial community shifts.

Identification of hydrogen oxidation coupled with antimonate reduction, a novel antimony biogeochemical cycling, in two contrasting antimony-contaminated environments | Microbiome

The research identifies a novel biogeochemical pathway wherein microbial hydrogen oxidation is directly coupled with antimonate reduction, potentially decreasing the mobility of antimony in environments affected by mining. DNA-stable isotope probing highlights taxa such as Azospirillum and Hydrogenophaga as key mediators in distinct habitats. This work underscores the role of microbial processes in altering heavy metal speciation and mitigating environmental contamination.

Gating and noelin clustering of native Ca2+-permeable AMPA receptors | Nature

This investigation elucidates the gating mechanisms and clustering behavior of native Ca²⁺-permeable AMPA receptors, with a focus on the modulatory role of the protein noelin. Advanced electrophysiological and imaging approaches reveal how receptor clustering influences channel properties and synaptic signaling. The study advances our understanding of the molecular organization underlying fast excitatory neurotransmission.

Decoding molecular mechanisms for loss-of-function variants in the human proteome | bioRxiv

Addressing loss-of-function mutations on a proteome-wide scale, this preprint decodes molecular mechanisms by which missense variants perturb protein stability and function. The study combines computational modeling with biochemical insights to link structural alterations with functional deficits. The analysis offers a systematic view of variant-induced destabilization, contributing to our understanding of genotype–phenotype relationships in genetic disorders.

🛠️ Latest Tools

scGT:  Integration algorithm for single-cell RNA-seq and ATAC-seq based on graph transformer | Oxford Academic

scGT is a Graph Transformer-based model for integrating single-cell RNA-seq and ATAC-seq data, enabling accurate label transfer and joint embedding. It constructs a hybrid graph using intra- and inter-dataset connections, refined through filtering, and applies self-attention to capture global and local relationships. Evaluated on paired, unpaired, and mismatched datasets, scGT outperforms existing methods in label accuracy and biological variation preservation.

The source code is available here.

NGPINT V3: A containerized orchestration Python software for discovery of next-generation protein–protein interactions | Oxford Academic

NGPINT V3 is a containerized pipeline for analyzing yeast two-hybrid NGS data to identify protein-protein interactions. It resolves prior compatibility issues using Docker and Singularity, enabling reproducible, cross-platform use. The tool supports any organism with a reference genome and integrates with Y2H-SCORES for downstream analysis.

The source code is available here. SeuratIntegrate: an R package to facilitate the use of integration methods with Seurat | Oxford Academic

SeuratIntegrate is an R package that enhances Seurat’s single-cell RNA-seq integration by supporting eight R and Python methods within R. It includes benchmarking tools, automated environment setup, and visualization features. A case study integrating immune cells from hepatocellular carcinoma datasets demonstrates its ability to reduce batch effects and preserve cell type distinctions across studies.

The source code is available here.

dbscATAC: a resource of single-cell super-enhancers/enhancers and gene markers derived from scATAC-seq data | Oxford Academic

dbscATAC is a single-cell database that annotates super-enhancers, typical enhancers, gene markers, and enhancer-gene interactions using scATAC-seq data across 13 species. It identifies over 213,000 super-enhancers and 10 million enhancer-gene links from 1.6 million cells. A case study demonstrates robust enhancer annotation across batches and projects, highlighting cell-type specificity. The platform includes analytic tools and a web interface for querying and visualizing regulatory elements at single-cell resolution.

The source code is available here.

EDAmame: interactive exploratory data analyses with explainable models | Oxford Academic

EDAmame is an R Shiny application for exploratory data analysis of tabular biological datasets, integrating data cleaning, visualization, correlation analysis, and machine learning modeling. It supports both supervised and unsupervised methods, with explainable AI tools for model interpretation. Using the WDBC dataset, EDAmame demonstrated high classification accuracy and feature explainability without requiring coding expertise. The tool is designed for accessibility and reproducibility in biomedical data exploration.

The source code is available here

AI-HOPE: An AI-Driven conversational agent for enhanced clinical and genomic data integration in precision medicine research | Oxford Academic

AI-HOPE is a locally deployed LLM-powered agent designed to integrate clinical and genomic data for precision medicine research via natural language queries. It supports association studies, survival analysis, and case-control comparisons without requiring programming. Demonstrated on TCGA data, AI-HOPE identified TP53 mutation enrichment in late-stage colorectal cancer and linked KRAS mutations to poorer survival in FOLFOX-treated patients. Its secure, scalable framework enables flexible, hypothesis-driven exploration across biomedical datasets.

The source code is available here.

spCLUE: a contrastive learning approach to unified spatial transcriptomics analysis across single-slice and multi-slice data | Genome Biology

spCLUE is a graph-based framework designed to analyse spatial transcriptomics data across both single and multi-slice samples. It uses multi-view contrastive learning to integrate spatial and gene expression information, while addressing batch effects through a dedicated module. The method supports clustering without requiring slice alignment and is evaluated across multiple platforms and tissue types

The source code is available here.

CellMemory: hierarchical interpretation of out-of-distribution cells using bottlenecked transformer | Genome Biology

CellMemory introduces a Transformer architecture inspired by global workspace theory to interpret out-of-distribution cells in single-cell omics. It uses a bottlenecked design with cross-attention to manage sparse data and infer cellular identities. The model enables gene-level interpretability and supports integration across species and modalities, with applications in characterizing malignant cell origins

The source code is available here.

GOBoost: Leveraging Long-Tail Gene Ontology Terms for Accurate Protein Function Prediction | Oxford Academic

GOBoost addresses the challenge of predicting protein functions associated with rare Gene Ontology terms. It introduces a global-local label graph and a multi-granularity focal loss to improve representation of long-tail labels. The method is benchmarked on PDB and AF2 datasets, showing enhanced performance across multiple functional categories

The source code is available here.

Fast and flexible minimizer digestion with digest | Oxford Academic

Digest is an open-source tool for efficient minimizer-based digestion of genomic sequences. It supports minimizers, modimizers, and syncmers, and is designed for scalability and integration into bioinformatics workflows. Implemented in C++ with Python and Rust bindings, it offers predictable spacing and multi-threaded performance for tasks like graph assembly and sequence classification

The source code is available here.

DrugTar Improves Druggability Prediction by Integrating Large Language Models and Gene Ontologies | Oxford Academic

DrugTar combines protein sequence embeddings from a pre-trained language model with gene ontology data to predict druggability. The model emphasizes the utility of sequence-based features and applies deep learning to enhance generalization. It is evaluated on benchmark datasets and made available via a web server and open-source repository.

The source code is available here.

Pathway Volcano: An interactive tool for pathway guided visualization of differential expression data | Oxford Academic

Pathway Volcano is an R-Shiny application that filters volcano plots by biological pathways using the Reactome API. It enables focused visualization of differential expression data by highlighting pathway-specific features. The tool supports interactive exploration and export of visual and tabular outputs

The source code is available here.

Inference of differential kinase interaction networks with KINference | Oxford Academic

KINference introduces a data-driven method for identifying differential kinase-substrate interactions between conditions using phosphoproteomics data. It constructs baseline networks and applies node and edge filters based on functional relevance and phosphorylation correlations. The approach is demonstrated on datasets involving kinase inhibition and viral infection

The source code is available here.

MitoTracer facilitates the identification of informative mitochondrial mutations for precise lineage reconstruction | PLOS Computational Biology

This research describes MitoTracer, an automated computational pipeline that detects informative mitochondrial mutations from single-cell RNA and ATAC sequencing data. By benchmarking against ground-truth experimental lineage data, the tool demonstrates high sensitivity and specificity. It offers a streamlined approach for reconstructing cellular clonal relationships, contributing to studies in development and cancer evolution.

The source code is available here.

📰 Community News

A critical link exists among high temps, aging and disease risk | Medical Xpress

Aging and extreme heat together weakens the gut and immune system, increasing susceptibility to infections like Vibrio vulnificus. UC Irvine researchers found that older mice exposed to heat showed more gut damage and immune dysfunction compared to younger mice. Probiotics like Roseburia intestinalis helped restore immune function in aged mice. This study highlights the need to support gut health to boost immune resilience during heat exposure.

Surviving breast cancer tied to lower Alzheimer's risk: Radiation therapy may offer short-term protection | Medical Xpress

Breast cancer survivors exhibit an 8% lower risk of Alzheimer's dementia (AD) compared to cancer-free individuals. Radiation therapy is significantly associated with this reduced risk, particularly in survivors aged 65 or older. The study analyzed national health insurance data of 70,701 breast cancer survivors and found that the lower AD risk diminishes over time, becoming indistinguishable from controls after five years. Researchers suggest integrating dementia risk management into survivorship care.

Global study links severe bleeding after childbirth to increased risk of cardiovascular disease | Medical Xpress

Severe bleeding after childbirth (postpartum hemorrhage) is linked to a higher risk of cardiovascular diseases, such as heart failure, stroke, and ischemic heart disease. A global study involving over 9.7 million women found that this increased risk can persist for up to 15 years. Researchers suggest that routine cardiovascular check-ups should be part of postpartum care for women who experience severe bleeding. The study emphasizes the importance of long-term maternal health beyond childbirth.

Can monoclonal antibodies effectively treat malaria? Scientists say the answer is a resounding 'yes' | Medical Xpress

Monoclonal antibodies (mAbs) have shown high efficacy in treating malaria, particularly targeting the Plasmodium falciparum parasite. Two mAbs, CIS43LS and L9LS, developed by U.S. researchers, bind to distinct regions on the parasite's surface, preventing it from entering the liver. Clinical trials have demonstrated that mAbs offer rapid protection and are safe for all ages. Researchers suggest that mAbs could be a crucial addition to malaria treatment strategies, especially in regions with high resistance to traditional methods.

New study locates neuron clusters that help the brain repay sleep debt | Medical Xpress

Neuron clusters in the thalamus called RE Vglut2 neurons help the brain repay sleep debt. Researchers found that these neurons are activated during sleep deprivation in mice, leading to deep, restorative sleep. The study used techniques like retrograde viral tracing and optogenetics to identify and manipulate these neurons. Findings suggest that understanding these neural circuits could aid in developing strategies for shift workers and those with sleep disorders.

Fusion genes found to be pivotal players in cancer development | News Medical Life Sciences

Researchers have identified fusion genes as central drivers in various cancers, highlighting their role in initiating and sustaining tumorigenesis. The study utilized large-scale genomic analyses to map fusion events across multiple cancer types, revealing recurrent patterns and associations with disease progression. These fusion genes often result from chromosomal rearrangements and can produce oncogenic proteins or deregulate gene expression. The findings suggest that fusion genes may serve as biomarkers for diagnosis and potential targets for precision therapies. The study emphasizes the need for further exploration of fusion-driven mechanisms in cancer biology.

Early infant behavior predicts cognitive ability decades later | News Medical Life Sciences

A longitudinal twin study by the University of Colorado Boulder found that cognitive assessments conducted as early as 7 months can predict cognitive performance at age 30. The research analyzed seven infant behavioral measures, with novelty preference and task orientation showing the strongest predictive value. Environmental factors in infancy were shown to significantly influence long-term cognitive outcomes. The study suggests early developmental signals may account for about 13% of adult cognitive variance.

LINE-1 elements found essential for early embryonic development | News Medical Life Sciences

Researchers have demonstrated that LINE-1 transposable elements, which comprise about 20% of the human genome, are crucial for early embryonic development. Contrary to their reputation as genomic parasites, LINE-1 elements were shown to promote developmental progression in human embryonic stem cells. Inhibition of LINE-1 expression caused cells to revert to a totipotent 8-cell stage, indicating their role in maintaining developmental trajectory. The study highlights the transcriptional activity of LINE-1 during early embryogenesis and its functional importance in cellular differentiation.

A Virus Arms its Bacterial Host with a Toxin, Helping Both to Thrive | The Scientist

A study reveals that a bacteriophage infecting Vibrio cholerae encodes a toxin that enhances the bacterium’s ability to outcompete neighboring microbes. The phage integrates into the bacterial genome and expresses a Type VI secretion system effector, which is lethal to rival bacteria but harmless to the host. This mutualistic relationship benefits both the virus and the bacterium, promoting survival and proliferation in competitive environments. The findings underscore the role of phage-mediated gene transfer in microbial ecology and host-pathogen dynamics.

Microbes in Baboon Poop Offer Clues into Aging | The Scientist

Researchers developed a microbiome clock using DNA from 14,000 fecal samples of 479 wild baboons, predicting age with a median error of two years. The study identified 1,440 microbial features that shift across life stages, with early and late life showing higher diversity. Socio-environmental factors like drought influenced microbiome age deviations from chronological age. The findings suggest gut microbiome composition reflects biological aging and may inform strategies for promoting healthy aging.

📅 Upcoming Events

Olink® Reveal: NGS-based high-plex proteomics reagent kit. Simplified for your lab | Olink 

This webinar will delve into how Olink Reveal simplifies proteomics research using Proximity Extension Assay (PEA) with NGS readout for broad protein coverage. It will explore the kit's sensitivity and reproducibility, measuring over 1,000 proteins for deep profiling. It will also explore on using the Olink Insight for selecting relevant protein panels and NPX Map for assessing data quality and interpreting QC metrics.

Exploring macromolecular protein assemblies and their visualisation with Complex Portal | EMBL-EBI 

This webinar explores on the Complex Portal, explaining how protein complexes are defined and how to search for them using accession numbers or keywords. It covers organism-specific searches, complex details pages, and the Complex Viewer. Additionally, it explores the new 'star-rating' system, the Complex Navigator for comparing complexes, and introduces predicted human complexes from huMAP3.0 and MuSIC maps. 

📚 Educational Corner

Shiny in Production 2025: Lightning Talk Lineup | R-bloggers  

The lightning talks at Shiny in Production 2025 showcase diverse applications of R Shiny in public health and safety. Highlights include AGES Austria’s dashboard for epidemiological surveillance across 76 disease categories and INRAE’s Rescuelog system for lifeguard activity monitoring. Talks emphasize real-time data visualization, user interactivity, and deployment challenges such as performance optimization and multi-user scalability. 

How to Build, Run, and Package AI Models Locally with Docker Model Runner | Docker   Docker Model Runner enables local execution and packaging of AI models without requiring Python environments or external APIs. It supports CLI and API-based interaction, model versioning, and integration with CI/CD pipelines. The tool facilitates privacy-preserving, low-latency inference and is compatible with models from Docker Hub and Hugging Face, targeting use cases like chatbots, medical imaging, and code generation.

BigOmics Meets Studios: End-to-end Exploration of RNA-Seq and Proteomics Data in Seqera Platform | Seqera 

BigOmics’ Omics Playground integrates with Seqera’s Studios to provide an interactive, reproducible environment for RNA-seq and proteomics analysis. The platform supports QC, differential expression, pathway enrichment, and biomarker discovery via no-code dashboards. It enables scalable data exploration and visualization, streamlining workflows from raw reads to publication-ready insights within Nextflow infrastructure.

Connect with Us

Stay connected and engage with us on social media for daily updates, discussions, and more!

📬 Subscribe

Don't miss an issue! Subscribe to the Bioinformer Weekly Roundup and receive the latest insights directly in your inbox.

Subscribe Now

We hope you enjoyed this week's edition of the Bioinformer Weekly Roundup. Feel free to share it with your colleagues and friends who share your passion for bioinformatics!


Disclaimer: The information provided in this newsletter is for educational and informational purposes only and does not constitute professional advice.

Contact: bioinformatics@zifornd.com

Copyright © 2025, Bioinformer Weekly Roundup. All rights reserved.



To view or add a comment, sign in

More articles by Zifo Bioinformatics

Others also viewed

Explore content categories