Tools for analysing fuzzy clusters of sequences data

BACKGROUND: Sequence analysis is a set of tools increasingly used in demography and other social sciences to analyse longitudinal categorical data.Typically, single (e.g., education trajectories) or multiple parallel temporal processes (e.g., work and family) are analysed by using crisp clustering algorithms that reduce complexity by partitioning c

read more

Building a best-in-class automated de-identification tool for electronic health records through ensemble learning

Summary: The presence of personally identifiable information (PII) in natural language portions of electronic health records (EHRs) constrains their broad reuse.Despite continuous improvements in automated detection of PII, residual identifiers require manual validation and correction.Here, we describe an automated de-identification system that emp

read more

Systematic expression analysis of ligand-receptor pairs reveals important cell-to-cell interactions inside glioma

Abstract Background Glioma is the most commonly diagnosed malignant and aggressive brain cancer in adults.Traditional researches mainly explored the expression profile of glioma at cell-population level, but ignored the heterogeneity and interactions of among glioma cells.Methods Here, we firstly analyzed the single-cell RNA-seq (scRNA-seq) data of

read more