Comparative phylogenomic analysis of chelicerates points to gene families associated with long-term suppression of host detection

We investigated patterns of gene family evolution across ticks and other parasites. We used phylogenetic profiling and trait-association tests to identify gene families that may enable parasitic species to feed on hosts undetected for prolonged periods.

Published Jan 1, 2025
DOI: 10.57844/arcadia-4e3b-bbea
Additional assets:

Purpose

Ticks and other parasitic chelicerates feed on our skin undetected for long periods of time. This requires blocking typical host responses to parasitism, such as itch, pain, and inflammation. We want to understand the mechanistic bases of this detection-suppression trait so we can design better therapeutic strategies for chronic skin diseases.

Here, we used comparative phylogenomics and trait mapping across 40 chelicerates to identify gene families that predict the ability of a parasite to suppress host detection. Out of the gene families we found, the most promising are antimicrobial peptides, cystatin protease inhibitors, and SAA1-like proteins. These candidate detection suppressors have known links to immunity, itch, and pain, creating opportunities to better understand their function in parasitism and skin biology.

That said, our approach and results come with caveats, and they need follow-up. We're sharing our methods and findings in case they’re useful to others interested in predicting the biological basis of traits. As our own methodology is a work in progress, we'd welcome feedback on how to continue to build and improve our phylogenomics toolkit. We’d also love feedback on our specific findings from domain experts in fields such as immunology, skin biology, and parasitology.

Background and goals

Ectoparasites are parasites that live on the outside of their hosts. They feed by breaching the host’s outermost barrier defenses, using salivary secretions to manipulate the local environment, and releasing host fluids they then ingest [1]. Ectoparasites must evolve specialized structures and strategies to get past their host's barrier defenses [1], and many organisms (from plants to animals) have ectoparasites that have evolved to prey on them [2][3].

Hematophagous (blood-feeding) ectoparasites consume blood, lymph, or exudate leaking from damaged skin [1]. Within hematophagous ectoparasites of vertebrates such as humans, there's a wide diversity of feeding behaviors. Mosquitoes attach to the host for only seconds to minutes [4], while leeches or argasid (soft) ticks feed for 20–30 minutes [5][6]. Some hard tick species have even evolved the ability to feed for longer than a day [7]. Such extreme strategies require substantial manipulation of the host dermis through the tick’s saliva, which contains hundreds to thousands of pharmacologically active molecules [8].

Some of the host systems that tick saliva manipulates include host sensory perception (itch, pain) and immune responses [8][9][10][11]. We broadly term these responses “host detection,” as they're systems that rapidly detect parasites or other sources of danger. We're interested in discovering the strategies that ticks use to block this detection. We believe these adaptations may inform new therapies for treating skin conditions characterized by itch, pain, and inflammation. In particular, prolonged feeders like ticks could give us a unique lens into signaling events in the skin over a longer time span, giving us clues for treating more stubborn chronic or delayed-onset conditions.

So far, we’ve used an experimental approach to identify molecules of interest. We extract itch-suppressing compounds from the saliva of the A. americanum ticks and narrow the list of potential effectors via sub-fractionation. This strategy has already proved fruitful, and we’ve been following up on several promising hits. However, it also has certain limitations that we hope to overcome with our evolution-informed computational platform. First, we’re limited in how many species of ticks we can empirically study given the throughput bottleneck imposed by experimental work. Second, experiments rely on activity-specific assays, meaning we can only survey a tiny slice of bioactivity at a time. Together, these limitations limit a more systematic understanding of global host manipulation mechanisms that ticks use.

Here, we designed an orthogonal computational approach to identify putative detection-suppressing effectors across a wider range of species. This approach, combining existing and new methods developed in-house, leverages comparative evolutionary genomics to test hypotheses and screen for gene families that putatively contributed to the evolution of long-term suppression of host detection. Since individual genes evolve at distinct rates and though distinct combinations of evolutionary mechanisms (e.g., gene duplication, loss, and horizontal transfer), reconstructing gene family histories can be quite informative about the genes that contribute to the evolution of ecologically relevant traits, such as those associated with ectoparasitism. You can read more about how this works in [12]. Importantly, a key part of this predictive power comes from our ability to compare patterns across species at both the genomic and phenotypic levels.

Illustration of some chelicerates in this study, highlighting ticks and scabies mites as long-term vertebrate parasites.

How did some parasites evolve to evade host detection?

Some species in the subclass Acari feed on vertebrate hosts, including humans, for long periods of time (> 1 day). They suppress host detection (including itch, pain, and inflammation), which could be therapeutically useful. We leveraged the diversity of lifestyles across all chelicerates to find genes associated with this behavior. We found 87 such gene families and filtered down to 10 promising putative suppressors of host detection.

Ticks and their relatives in the subphylum Chelicerata are ripe for comparative investigation into the evolution of traits related to parasitic life histories. Chelicerates have substantial variation in their ecology and traits relating to parasitism, diet, and feeding time, suggesting a comparable diversity of mechanisms underlying the suppression of host detection. Parasitic life histories have been gained and lost repeatedly across the group, particularly within the sea spiders, mites, and ticks [13]. Parasitic lineages have also undergone numerous host-switches [13]. Individual chelicerate species parasitize a broad diversity of host species, ranging from microscopic invertebrates such as nematodes to sea anemones, other mites, insects, and all major vertebrate groups [13]. Host specificity is also highly variable [13]. For instance, tick species range from being highly specialized with salivary cocktails tailored to their hosts (a process potentially driven by gene duplication) to extreme generalists that feed on phylogenetically and morphologically diverse host species [14]. Furthermore, the duration spent feeding on their hosts is highly variable across chelicerates [6][7], suggesting the group may harbor substantial diversity in the types of pharmacologically active molecules suited to treating chronic skin diseases.

In this work, we searched for gene families (orthogroups) with diversification histories consistent with the hypothesis that they contributed to the evolution of long-term suppression of host detection. To do so, we designed and implemented a “phylogenomic association test,” which performs phylogenetic profiling using the outputs of NovelTree, our method for phylogenomic inference [15]. We identified 87 gene families with gene copy numbers that were strongly and significantly associated with long-term suppression of host detection. By filtering for patterns consistent with secreted salivary effectors, we narrowed this list to 10 orthogroups, four of which had clear functional annotations. These orthogroups are a starting point for us to make new hypotheses about how ticks modulate host responses like itch, pain, and inflammation.

Orthogroups vs. gene families

Throughout this pub, we use the terms orthogroups and gene families interchangeably. We use these words to refer to groups of evolutionarily related genes derived from a single gene present in the last common ancestor of all the species analyzed [16].

The approach

Our approach to identifying chelicerate proteins associated with host detection suppression consisted of four main steps: designing the study (including curating publicly available data), performing phylogenomic inference with NovelTree [15], testing for associations between gene family diversification and the capacity for long-term suppression of host detection, and filtering down to gene families most likely to be secreted salivary effectors (Figure 1). Click here to skip past our methodology and jump straight to the results.

Workflow diagram of the analysis steps in this study used to predict which orthogroups are suppressors of host detection.

Figure 1. Our workflow for identifying gene families that help long-term feeders suppress host detection.

We started with 40 chelicerate proteomes and then used NovelTree to identify gene families and reconstruct their evolutionary histories. We moved 7,529 gene families that met our criteria to the phylogenetic profiling stage, where we measured the association of orthogroups with the ability of a chelicerate to suppress host detection. Out of the 87 orthogroups positively associated with this trait, we filtered down to the 10 most promising that seem most likely to be secreted and directly interact with the host.

Curating chelicerate proteomes

To explore gene family evolution in ticks and other blood-feeding or parasitic species, we used our phylogenomic inference tool, NovelTree [15]. The NovelTree workflow takes proteomes (i.e., one amino acid sequence for each protein-coding gene in a genome or transcriptome) from diverse organisms and infers orthology, gene family trees, species trees, and gene family evolutionary dynamics. For this initial investigation into the evolution of host detection suppression by chelicerates, we decided to target 30–50 species.

Based on our interest in prolonged feeding, we focused on ticks and their close relatives. We compiled a set of 15 tick proteomes and built out taxonomic breadth and depth within the tick-containing subphylum Chelicerata. Ultimately, this dataset consisted of proteomes from 40 chelicerate species, including class Pycnogonida (sea spiders; n = 2), subclass Acari (a historical designation for mites and soft or hard ticks; n = 32), orders Opiliones (harvestmen; n = 2), Palpigradi (microwhip scorpions; n = 1), Ricinulei (hooded tick spiders; n = 1), Solifugae (sun spiders; n = 1), and Xiphosura (horseshoe crabs; n = 1). We selected these species groups based on recent chelicerate phylogenetic hypotheses and associated life history transitions (e.g., [17][18][19]).

We used 25 proteomes generated from genomic data and 15 from transcriptomes to study these species. To maximize proteome completeness, we only used transcriptomes derived from either whole animals or multiple tissues, except for the Ornithodoros turicata transcriptome, which came from only the tick synganglion. Most of the proteomes used in this analysis had > 75% of the Arachnida Odb10 BUSCOs [20] (Figure 2), meaning they're relatively complete despite their varied origins. As previously described in detail here (Snakemake workflow found here), before running NovelTree, we pre-processed all 40 proteomes to filter out redundant and short sequences and curate functional annotations (e.g., KEGG annotations) [15].

We ran the pre-processing Snakemake workflow using the sample sheet “chelicerate-samples.tsv.” This and all proteome processing outputs are on Zenodo (DOI: 10.5281/zenodo.14113178). Outputs of the proteome data curation pipeline are in “chelicerate_proteome_preprocessing_outputs.zip,” and all chelicerate protein sequences are found in “2024-06-24-all-chelicerate-noveltree-proteins.fasta.”

Species tree of the chelicerates in this study, showing proteome source and proteome completeness.

Figure 2. Proteome source and completeness for the 40 chelicerate species in our study.

Proteomes are organized into a species tree, which we inferred by SpeciesRax using 4,969 gene families as implemented in NovelTree. We show BUSCO completeness scores for each proteome, calculated using the Arachnida Odb10 dataset. Proteomes sourced from transcriptomes are highlighted in light grey, and proteome sources from genomes are marked in dark purple.

Curating chelicerate trait data

To predict the basis of long-term suppression of host detection in our trait association tests, we had to assign the presence or absence of that trait to each of our 40 chelicerate species. We established four ecological criteria that an organism would have to satisfy to be designated a long-term suppressor: (1) the organism is a parasite, (2) the organism feeds on vertebrates, (3) the organism must feed for > 1 day, and (4) the organism can’t be known to cause immediate itch, pain, or inflammation in a bitten host. We then manually curated the relevant trait data from the literature for each species included in our phylogenetic analyses (Table 1 & Table 2). We classified species reported to meet these criteria as long-term suppressors of host detection (Table 2).

Implications of omitting parasites of non-vertebrates

Many chelicerates are parasites of non-vertebrate species, such as plants and other arthropods. They may also suppress host detection for extended periods. However, given the overall limited data on exact feeding durations and the differences in the mechanisms underlying host recognition of ectoparasites across vertebrates, arthropods, and plants in the literature, we limited our search to adaptations that specifically enable parasites to feed on vertebrates. While this narrowed scope might mean missing out on the full mechanistic diversity of host detection suppression in chelicerates, it increases our ability to interpret our results in the context of known human biology.

SpeciesCommon nameDietHostDiet and lifestyle reference(s)Feeding duration (days)Feeding duration reference
Adenacarus sp.Long-legged mitesVarious small organisms, pollen, fungiNA[21]NANA
Amblyomma americanumLone star tick (hard tick)BloodVertebrate[22]12[23]
Amblyomma sculptumHard tickBloodVertebrate[22]14[24]
Blomia tropicalisDust miteSkin flakes (scavenged)NA[25]NANA
Dermacentor andersoniRocky Mountain wood tick (hard tick)BloodVertebrate[22]6[26]
Dermacentor silvarumHard tickBloodVertebrate[22]7[27]
Dermacentor variabilisAmerican dog tick (hard tick)BloodVertebrate[22]8[23]
Dermanyssus gallinaeRed poultry miteBloodVertebrate[28]0.04[29]
Dermatophagoides farinaeAmerican house dust miteSkin flakes (scavenged)NA[25]NANA
Dinothrombium tinctoriumRed velvet miteHemolymphArthropod[30]UnknownNA
Eukoenenia spelaeaMicrowhip scorpionScavenges detritus, cyanobacteriaNA[31][32]NANA
Euroglyphus mayneiMayne's house dust miteSkin flakes (scavenged)NA[33]NANA
Galendromus occidentalisWestern predatory miteHemolymphArthropod[34]UnknownNA
Galeodes spCamel spiderVarious small arthropodsNA[35]NANA
Haemaphysalis longicornisAsian long-horned tick (hard tick)BloodVertebrate[22]12[36]
Hyalomma asiaticumHard tickBloodVertebrate[22]10[37]
Ixodes persulcatusYaiga tick (hard tick)BloodVertebrate[22]9[38]
Ixodes ricinusCastor bean tick (hard tick)BloodVertebrate[22]7[39]
Ixodes scapularisBlack-legged tick (hard tick)BloodVertebrate[22]7[23]
Leptotrombidium delienseScrub typhus mite (chigger mite)Skin exudateVertebrate[40][41]3[40]
Limulus polyphemusAtlantic horseshoe crabSmall animals and detritusNA[42]NANA
Nymphon gracileSea spiderBody fluidsCnidaria[43][44]UnknownNA
Oppiella novaSoil miteFungiNA[45]NANA
Ornithodoros erraticusSoft tickBloodVertebrate[46]0.01[6]
Ornithodoros moubataAfrican relapsing fever tick (soft tick)BloodVertebrate[46]0.01[6]
Ornithodoros turicataRelapsing fever tick (soft tick)BloodVertebrate[46]0.01[6]
Ornithonyssus sylviarumNorthern fowl miteBloodVertebrate[47]0.04[29]
Phalangium opilioDaddy longlegs (harvestman)Various small arthropodsNA[48]NANA
Psoroptes ovisSheep scab miteSkin exudateVertebrate[49]27[50]
Pycnogonum litoraleSea spiderBody fluidsCnidaria[51]UnknownNA
Rhipicephalus microplusCattle tick (hard tick)BloodVertebrate[22]7[52]
Rhipicephalus sanguineusBrown dog tick (hard tick)BloodVertebrate[22]9[23]
Ricinoides atewaAtewa hooded spiderTermites and antsNA[53]NANA
Sarcoptes scabieiItch miteSkin exudateVertebrate[54]17[54]
Siro boyeraeMite harvestmanSmall animals and detritusNA[55]NANA
Tetranychus urticaeRed spider mitePlant cell cytoplasmPlant[56]UnknownNA
Tropilaelaps mercedesaeHoney bee miteHemolymphArthropod[57]2[57]
Tyrophagus putrescentiaeCheese miteFungiNA[58]NANA
Varroa destructorVarroa miteInsect fat bodiesArthropod[59]0.01[60]
Varroa jacobsoniVarroa miteInsect fat bodiesArthropod[60]0.01[60]

Table 1. Trait data for all chelicerate species we used in this study.

We report each species in this study's common name, whether it's a parasite, and its diet. For the parasites, we also report the duration of feeding time. In cases where we couldn't find the relevant trait information, we designate the trait as “unknown.” In the cases where the column isn't applicable (e.g., feeding duration doesn’t apply to non-parasites), we use the designation “NA.”

OrganismHost detectionDetection reference(s)Designation
Hard ticks (12 species)Suppress itch, pain, and inflammation during feeding, not detected by bitten humans[9][10][11][8]All are long-term detection suppressors
Leptotrombidium delienseBite is described as a “sting,” appears to be detected rapidly by bitten humans[41]Causes immediate pain, not a long-term detection suppressor
Psoroptes ovisCauses intense itch after massive replication, appears to be a latent phase before sheep begin itching[50]Long-term detection suppressor
Sarcoptes scabieiCauses intense itch after massive replication in the skin, not noticed by parasitized humans for up to 30 days[61][62]Long-term detection suppressor

Table 2. Host detection suppression in long-term parasites of vertebrates.

For the chelicerates in our study that parasitize vertebrates for > 1 day, we reviewed the literature to understand whether they're potential suppressors of host detection. We report the relevant host-detection traits and our final designation of whether the organism is a long-term suppressor of host detection. Here, we group all hard ticks together, as they're all long-term parasites of vertebrates and, to the best of our knowledge, share the ability to suppress host detection. In addition to hard ticks, we also predict that P. ovis and S. scabiei can suppress host detection for > 1 day.

Inferring gene family evolutionary histories with NovelTree

We ran NovelTree v1.0.2 using the same settings as described in its original pub [12]. Briefly, we used WITCH v0.3.0 [63] for multiple sequence alignment and IQ-TREE 2 v2.2.0.5 [64] for gene family tree inference. We required that gene families retained for phylogenetic analysis contained proteins from at least five species and had a mean per-species gene copy number ≤ 20 (7,529 gene families). After multiple sequence alignment, we removed sequences with an ungapped length of ≤ 20 amino acids. To infer our species tree, we used the gene families with fewer than a mean per-species copy number of 10 (4,969 gene families).

Run configuration and all NovelTree input and outputs are on Zenodo (DOI: 10.5281/zenodo.14113178).

  • The detailed NovelTree run configuration is in its parameter file “chelicerata-v1-parameterfile.json
  • Inputs are “chelicerata-v1-samplesheet.csv,” and outputs are “chelicerata-v1-10062023.zip” and “summary_of_noveltree_results.zip
  • You can find the orthogroup table in “chelicerata-v1-10062023.zip” with file path “chelicerata-v1-10062023/orthofinder/complete_dataset/Results_Inflation_1.5/Orthogroups/Orthogroups.GeneCount.tsv

Phylogenetic profiling

We sought to identify gene families most strongly associated with suppressing host detection. We first generated phylogenetic profiles — counts of evolutionary events for each species and each internal branch of the species tree — for each gene family using the outputs of GeneRax using R (v4.4.1) [65]. Note that our approach to phylogenetic profiling is distinct from the approach more typically applied to bacterial species, which uses gene presence/absence rather than gene copy number or event counts [66].

Here, we focus on speciation events. These occur when a parent species diverges into two daughter species, resulting in the original gene branching into two separate gene lineages that persist to the present. Speciation events thus capture multiple features of gene family evolution that are closely tied to gene copy number and correlate with the other event types — gene duplication, transfer, and loss.

Performing independent association tests for each (> 10,000) gene family carries the risk of elevated false-positive rates and the spurious identification of gene family–trait associations. Consequently, we took a hierarchical approach where we: 1) clustered gene families according to their profile similarity, 2) conducted statistical tests to identify which gene family clusters most strongly predicted suppression of host detection, and 3) carried out post-hoc tests using individual gene families within significant clusters. Below, we provide details of the methods used at each step.

If you want to use this phylogenetic profiling pipeline for your research, you can find it in our GitHub repo (DOI: 10.5281/zenodo.14218627).

Clustering

Phylogenetic profiles of gene family evolutionary event counts at each node in the tree are evolutionarily and thus statistically non-independent. To account for this, we conducted a phylogenetic transform on the log10(x + 1) normalized count data [67], effectively returning residual trait variation not explained by shared evolutionary history (i.e., phylogeny) alone. To facilitate the clustering of gene families, we subsequently quantified multivariate Mahalanobis distances between all gene families using the transformed count data. For more details on our implementation of this approach, see [68].

We combined these approaches to develop a pipeline to conduct phylogenetic profiling using the outputs of NovelTree, implemented by the function run_phylo_profiling. In brief, this function takes formatted event count data (speciation, duplication, transfer, loss, or all four) and the inferred species tree from NovelTree and subsequently:

  1. Applies the phylogenetic GLS transformation to the event count data
  2. Calculates all pairwise multivariate distances among gene families using all event count data
  3. Constructs a k-nearest neighbor (kNN) graph of gene families where k = 10 and edge weights are specified as similarities where similarity=exp(distance)similarity = exp(- distance) using dbscan [69] and igraph [70] after removing singleton (disconnected) components
  4. Clusters gene families using one or all of Leiden [71][72], Walktrap [73], or Infomap [74][75] clustering algorithms, optimizing for modularity
  5. Visualizes gene family clusters using t-distributed UMAP on the count data using the R package uwot [76], embedding gene families into a two-dimensional projection to facilitate visualization interactively with plotly [77]

For downstream trait association tests, we used the clusters inferred by the Leiden algorithm because it maximized modularity. Before performing our trait association tests for each cluster, we conservatively removed Leiden clusters with less than 25 orthogroups. We made this decision because the small sample sizes in these clusters led to noisy parameter estimates, possibly due to a greater sensitivity to model initialization conditions and low statistical power. We removed these clusters to prevent possible spurious associations with our trait of interest.

Phylogenomic trait association tests

As previously mentioned, we developed a pipeline to implement hierarchical association tests to identify which gene families have evolutionary histories most predictive of suppressing host detection. Using all clusters containing at least five gene families, the pipeline:

  1. Calculates the mean event count (after phylogenetic GLS transformation) for each gene family across species that suppress host detection and those that don't, grouping mean counts within each cluster.
  2. Conducts logistic regressions to identify clusters composed of gene families with event count distributions that most strongly differ between itch suppressors and non-suppressors (Figure 4, A)
  3. Conducts post-hoc phylogenetic logistic regressions using the phyloglm function from the phylolm R-package [78] for each gene family within the most strongly associated (i.e., p ≤ 0.05 & coefficient in top 10%) clusters, identifying gene families that positively predict whether species suppresses host detection (p ≤ 0.05, log-odds > 0: Figure 4, B–E)

We implement the per-cluster association tests in the cluster_assoc_test function, the post-hoc per-family tests in the test_fam_corrs_per_clust function, and plot results of these tests using the plot_cluster_traitcorr function.

We implement the entire pipeline on the chelicerate dataset in the “01_phylo_profiling_genefam_evol_counts.Rscript.

Orthogroup filtering

After conducting our hierarchical phylogenetic trait association tests, we chose to move forward with all orthogroups that were positively and significantly associated with suppression of host detection (i.e., p ≤ 0.05). We then sought to narrow this pool of orthogroups down to families of putative salivary effector proteins that could be direct mediators. We filtered for orthogroups where DeepSig (v1.2.5) [79] predicted at least 50 percent of the proteins have signal peptides and removed orthogroups containing proteins that the DeepTMHMM web server (v1.0.42) [80] predicted to have transmembrane domains.

Next, we wanted to filter down to orthogroups that are transcriptionally expressed in the salivary gland. To do this, we took advantage of our previous transcriptomic studies [81][82] of the Amblyomma americanum (lone star tick) salivary gland. Using our expression analysis [82] of our previously generated salivary gland transcriptome [81], we scored each gene in the A. americanum genome as “salivary expressed” or not. We then required that at least 25% of A. americanum genes in each orthogroup were “expressed" in our salivary transcriptome data. As a caveat, this means that we dropped orthogroups without an A. americanum representative from the analysis. In future iterations, we'd love to incorporate salivary gland expression data from many more chelicerates to make this analysis more robust and less biased toward the biology of a single species.

As our final filter, we also required that the orthogroup must have a representative gene from at least six out of 15 tick species represented in our data and that across these tick species, at least 10 genes were present in the orthogroup. We chose to do this because ticks are our focal species in this analysis, and we reasoned that strong suppressors of host detection would be shared across ticks.

We used the “02_clusters-orthogroups-analysis.Rscript to perform the filtering described in this section. The A. americanum salivary transcriptome expression table is on Zenodo in the file “tx2gene.tsv.”

Additional methods

We used ChatGPT to help write, clean up, and comment our code.

The results

Our trait association tests identify 10 candidate secreted salivary suppressors of host detection

To search for genes involved in the long-term suppression of itch, pain, and inflammation, we performed evolutionary comparisons across proteomes of 40 chelicerate species with varied traits involving parasitism and long-term suppression of host detection (Figure 3). The first step in this process was to define gene families (or orthogroups) using NovelTree [12]. NovelTree identified a total of 31,160 orthogroups, 7,529 of which comprised five or more species and had a maximum mean per-species copy number of 20. Of these, 4,969 had a mean per-species copy number of 10 and were therefore more likely to be phylogenetically informative, so we used these to infer species trees using SpeciesRax [83] as implemented in NovelTree.

Species tree of chelicerates in this study showing parasitism-related traits, including suppression of host detection.

Figure 3. Species tree of the 40 chelicerate species studied here, as well as their life history and trait data.

Input proteomes are organized by the species tree on the left, which we inferred by SpeciesRax using 4,969 gene families containing at least five species with a maximum mean per-species copy number of 10 as implemented in NovelTree. We curated trait data manually through literature review. To be designated a long-term suppressor of host detection, an organism had to be parasitic, feed on vertebrates, have a feeding time > 1 day, and not be known to cause itch, pain, or inflammation immediately after biting its host.

Next, we applied our phylogenetic profiling pipeline to these 7,529 orthogroups. We clustered these orthogroups into 71 Leiden clusters and found that 46 were positively and significantly associated with suppression of host detection after correcting for multiple tests (false-discovery rate/FDR ≤ 0.05; Figure 4, A). These clusters are sets of orthogroups with evolutionary histories that aren't just highly similar across chelicerates, but also closely track the evolution of the suppression of host detection. From these 46 clusters, we focused our downstream investigation on those for which the estimated coefficient was in the 90th percentile for positively associated clusters. This led us to retain five clusters comprising 832 orthogroups and 19,693 genes (mean size = 23.7 genes, standard deviation/SD = 38.1).

Ordered scatterplots showing the association of orthogroups with the detection-suppression trait.

Figure 4. Hierarchical phylogenomic trait-association tests identify 87 gene families (orthogroups) with speciation counts that are significantly (p ≤ 0.05) associated with the suppression of host detection.

(A) Trait-association results for each of the 71 Leiden clusters of orthogroups identified according to patterns of phylogenetic profile similarity (i.e., counts of speciations, duplications, transfers, and losses at each node of the species tree). Clusters are plotted along the x-axis, ordered by increasing sign and strength of association. Clusters are plotted according to their strength and the significance of their association, as indicated by the key.

(B–E) Trait-association results for each gene family within profile Leiden clusters 46 (B), 27 (C), 40 (D), and 33 (E), using a phylogenetic logistic regression. Each gene family is plotted along the x-axis, ordered by increasing sign and strength of association, and points are colored/shaped according to the same key in A. In each plot, we have labeled the orthogroups that are families of secreted effectors. No gene families in cluster 44 met our filtering criteria for secreted effectors; therefore, this cluster isn't shown here.

After conducting post-hoc association tests for each orthogroup within these five clusters, we identified 87 orthogroups that are positively and significantly associated with the suppression of host detection (p ≤ 0.05: Figure 4, B–E). It’s worth noting, however, that none of these orthogroups remained significant (FDR ≤ 0.05) after correcting for multiple tests. This is perhaps to be expected given the size of our dataset (only 40 species and two independent origins of long-term suppression of host detection) and the number of tests conducted (19,693 total post-hoc tests).

To identify the orthogroups most likely to contain detection-suppressing effector proteins secreted into the host skin, we identified orthogroups that we predicted would be expressed in the salivary gland and secreted into the saliva. We also filtered out orthogroups that were very small (< 10 gene copies) or present in less than six out of 15 tick genomes, since we figured that potent suppressors of host detection would be widespread in ticks. After all these filtering steps, we ended up with 10 orthogroups of secreted effectors putatively involved with suppressing host detection (Table 3). One cluster, cluster 44, didn't have any orthogroups that met our criteria, so we dropped it from the analysis at this point.

OrthogroupCluster# of genes in orthogroupPutative protein functionslog-oddsp-value
OG000074746102Unknown0.3410.017
OG00019704659Annotated as serum amyloid A protein
[BLAST and Foldseek confirm]
0.6400.012
OG00088494616Unknown2.1290.035
OG00096264613Unknown5.6640.002
OG00099054612Unknown2.4780.014
OG00052782739Annotated as fetuin-B; cystatin-SN; kininogen; cystatin-10
[BLAST and Foldseek confirm]
0.9600.048
OG00017744062Annotated as defensins, drosomycins (AMPs)
[BLAST and Foldseek confirm]
1.0580.031
OG00077744022Foldseek hits to basic tail secreted proteins (function unknown)1.8270.027
OG00008803393BLAST hits to acanthoscurrin-2-like proteins (AMPs)0.3140.012
OG00075163324Unknown0.9720.019

Table 3. Secreted orthogroups implicated in host detection suppression.

Each orthogroup is listed along with the original Leiden cluster from which it derived (Figure 4, A). We show the total number of genes in each orthogroup, the log-odds coefficient, and the p-value (not FDR-corrected) for each orthogroup’s association with the detection-suppression trait. To functionally annotate these orthogroups, we used common KEGG annotations found in this orthogroup, BLAST, and Foldseek. Of these 10 orthogroups, four had interpretable functional annotations across the orthogroup: OG0001970, OG0005278, OG0001774, and OG0000880. AMP stands for antimicrobial peptide.

SHOW ME THE DATA: Access the input proteomes, NovelTree inputs and outputs, and Amblyomma americanum gene expression data from this study on Zenodo. You can find our phylogenetic profiling methods and results on GitHub.

Exploring the candidate host-detection suppressors with annotations

Next, we’ll walk through the four orthogroups with clear annotations, discuss what we know about them, and speculate on how they may be involved in suppressing detection.

Serum amyloid A1-like proteins

OG0001970 comprises serum amyloid A1 (SAA1)-like proteins, which are highly conserved in vertebrates and involved in the inflammatory immune response [84]. To our knowledge, SAA1-like proteins haven't been characterized in ticks, so our functional predictions of the tick SAA1-like proteins are derived from our understanding of SAA1 in vertebrates.

In humans, SAA1 is a well-known inflammatory biomarker, increasing over 1000-fold in serum during a systemic inflammatory response [84]. Despite its strong correlation with inflammation, the functional role of SAA1 during a systemic inflammatory response is still poorly understood. In recent years, progress has been made in unraveling the role of SAA1 produced locally at tissue barrier sites. In the gut, SAA1 is implicated in generating inflammatory Th17 cells [85] and has been proposed as a potential drug target in irritable bowel disease [86]. In the respiratory tract, SAA1 acts as a soluble pattern recognition receptor that binds lipocalin proteins from house dust mites, triggering an allergenic type II immune response [87]. It's worth noting that house dust mites are also chelicerates, and lipocalins are a major class of secreted tick salivary effector proteins [88][89][90]. These findings make us wonder if human SAA1 is involved in inflammation and immune responses in the skin, and whether it can also recognize tick lipocalins.

Thus, the role of the vertebrate SAA1 in chelicerate recognition, allergy, and inflammation suggests a potential role for the tick SAA1 in modulating the host immune response during tick feeding.

Cystatin proteins

OG0005278 is a family of cystatin proteins. The cystatin superfamily of proteins canonically functions to inhibit cysteine proteases [91]. However, some subfamilies of cystatins have roles outside of protease inhibition. For example, the kinogen family has non-inhibitory alternative functions, including serving as precursor proteins for the production of kinin peptides [92]. Fetuins, another subfamily of cystatins, are circulating proteins involved with immune regulation that have been shown to bind free fatty acids [93] as well as other circulating signaling molecules [94]. Most characterized tick cystatins appear to act conventionally as cysteine protease inhibitors [95]. Excitingly, many tick cystatins have anti-inflammatory effects [95][96]. In humans, various cysteine proteases are involved in itch [97], pain [98], and inflammation [99][100], making inhibition of cysteine protease activity a plausible mechanism for ticks to suppress host detection.

Antimicrobial peptides (AMPs)

OG0000880 and OG0001774 are both annotated as antimicrobial peptides. OG0000880 is an orthogroup of acanthoscurrin-2-like proteins [101], and OG0001774 is a group of defensin- or drosmycin-like proteins. Defensins are a broad category of AMPs distributed across eukaryotes [102], and drosomycin is an insect defensin with antifungal activity [103][104]. It’s possible that these peptides act canonically as antimicrobials, protecting ticks or the host from infection. This could enable longer feeding time by either directly protecting the tick from pathogenic colonization, or by preventing skin infection and subsequent inflammation in the host. However, defensins can have many noncanonical roles outside of antimicrobial defense [102], and some tick defensin-like proteins are known to act directly on host mast cells, where they can drive itch and inflammation in the absence of other salivary components [105].

While the potential of tick peptides to modulate immune cell activity is fascinating, it’s difficult to hypothesize more about the activity of these particular peptide orthogroups as we don’t know their target cell type (be it bacterial, fungal, or host immune cells).

Key takeaways

Using a combination of phylogenetic profiling, phylogenetic trait-association tests, and protein sequence features, we identified 10 gene families in chelicerates that may contribute to suppressing host detection. Of these, we could confidently assign a function to four using KEGG annotations and structural similarity. Two families are predicted to be AMPs, one is a family of cysteine protease inhibitors called cystatins, and the last family is of SAA1-like proteins. Promisingly, SAA1 is best known as an inflammation-associated protein in humans, and tick cystatins have been shown to have anti-inflammatory functions. Our success in finding two gene families with clear, intuitive ties to inflammation points to the potential viability of our approach.

Caveats

We're excited about the potential to use these kinds of phylogenetic approaches to identify gene families with human biology applications, and we hope this pub provides a framework for how others might do this. That being said, our approach comes with some caveats. First, our trait-mapping approach depends on our ability to properly label species as detection-suppressing or not, which is hard to do empirically or systematically with publicly available data. Also, although we got interesting hits, none of the individual orthogroups associated with the host detection suppression trait remained significant after correcting for multiple tests. We expect that increasing the size of our dataset to include more independent evolutionary origins of long-term suppression of host detection could increase our statistical power.

Next steps

At the end of the day, we see this work as an exciting strategy for generating hypotheses. We’ll be following up on some of our hits computationally to see if we can get more confidence about how they might act to modulate human skin biology.

Although the approach we’ve developed here has proven useful to us in charting a path forward, much is left to be done. For instance, we’d like to improve the statistical power of our gene family association tests. As implemented, our gene family association tests are quite conservative, as no individual gene family remained a strong predictor of host-detection suppression after correcting for multiple tests. Because we carried out thousands of individual association tests, the bar for individual gene families to be associated with a trait of interest is necessarily very high to mitigate the risk of identifying false positives.

We'd also like to implement recently described complementary approaches that account for evolutionary non-independence by borrowing from the field of statistical genetics [106]. These have shown promise and were recently applied to study the contribution of gene family diversification to the evolution of metabolic traits in yeasts [107]. To increase their utility for the broader scientific community, we aim to generalize these approaches into a reproducible package of standalone functions for any run of NovelTree.

Finally, since the statistical approaches we describe in this pub are still in development, we'd love your feedback on improving them.


Provide feedback

Social3

1
Yu Z, Zheng H, Chen Z, Zheng B, Ma H, Liu J. (2010). The life cycle and biological characteristics of Dermacentor silvarum Olenev (Acari: Ixodidae) under field conditions. https://doi.org/10.1016/j.vetpar.2009.11.010
2
Nelson WA, Keirans JE, Bell JF, Clifford CM. (1975). Review Article:1Host-Ectoparasite Relationships. https://doi.org/10.1093/jmedent/12.2.143
3
Pritchard J, Kuster T, Sparagano O, Tomley F. (2015). Understanding the biology and control of the poultry red miteDermanyssus gallinae: a review. https://doi.org/10.1080/03079457.2015.1030589
4
HOPLA C, DURDEN L, KEIRANS J. (1994). Ectoparasites and classification. https://doi.org/10.20506/rst.13.4.815
5
Oliver JH. (1965). Karyotypes and Sex Determination in Some Dermanyssid Mites (Acarina: Mesostigmata)1. https://doi.org/10.1093/aesa/58.4.567
6
Powell G, Tosh CR, Hardie J. (2006). HOST PLANT SELECTION BY APHIDS: Behavioral, Evolutionary, and Applied Perspectives. https://doi.org/10.1146/annurev.ento.51.110104.151107
7
Zhang Z-Q. (1998). Review Biology and ecology of trombidiid mites (Acari: Trombidioidea). https://doi.org/10.1023/a:1006002028430
8
Martin-Martin I, Williams AE, Calvo E. (2023). Determination of Mosquito Probing and Feeding Time to Evaluate Mosquito Blood Feeding. https://doi.org/10.1101/pdb.top107659
9
Smrž J, Kováč Ĺ, Mikeš J, Lukešová A. (2013). Microwhip Scorpions (Palpigradi) Feed on Heterotrophic Cyanobacteria in Slovak Caves – A Curiosity among Arachnida. https://doi.org/10.1371/journal.pone.0075989
10
Lent CM, Fliegner KH, Freedman E, Dickinson MH. (1988). Ingestive Behaviour and Physiology of the Medicinal Leech. https://doi.org/10.1242/jeb.137.1.513
11
Balestra V, Lana E, Vanin S. (2022). Observations on the habitat and feeding behaviour of the hypogean genus Eukoenenia (Palpigradi, Eukoeneniidae) in the Western Italian Alps. https://doi.org/10.3897/subtbiol.42.75784
12
Berenger J-M, Parola P. (2017). Arthropod Vectors of Medical Importance. https://doi.org/10.1016/b978-0-7020-6285-8.00012-5
13
Colloff MJ. (1991). A review of biology and allergenicity of the house-dust miteEuroglyphus maynei (Acari: Pyroglyphidae). https://doi.org/10.1007/bf01246090
15
McMurtry JA, Croft BA. (1997). LIFE-STYLES OF PHYTOSEIID MITES AND THEIR ROLES IN BIOLOGICAL CONTROL. https://doi.org/10.1146/annurev.ento.42.1.291
16
Nuttall PA. (2019). Wonders of tick saliva. https://doi.org/10.1016/j.ttbdis.2018.11.005
17
Hebets EA, Oviedo-Diego M, Cargnelutti F, Bollatti F, Calbacho-Rosa L, Mattoni CI, Olivero P, Simian C, Abregú D, Vrech DE, Peretti AV. (2023). A scientist’s guide to Solifugae: how solifuges could advance research in ecology, evolution, and behaviour. https://doi.org/10.1093/zoolinnean/zlad174
18
Aounallah H, Bensaoud C, M’ghirbi Y, Faria F, Chmelar̆ J, Kotsyfakis M. (2020). Tick Salivary Compounds for Targeted Immunomodulatory Therapy. https://doi.org/10.3389/fimmu.2020.583845
19
Marendy D, Baker K, Emery D, Rolls P, Stutchbury R. (2020). Haemaphysalis longicornis: the life-cycle on dogs and cattle, with confirmation of its vector status for Theileria orientalis in Australia. https://doi.org/10.1016/j.vpoa.2019.100022
20
Šimo L, Kazimirova M, Richardson J, Bonnet SI. (2017). The Essential Role of Tick Salivary Glands and Saliva in Tick Feeding and Pathogen Transmission. https://doi.org/10.3389/fcimb.2017.00281
21
Chen Z, Yu Z, Yang X, Zheng H, Liu J. (2009). The life cycle of Hyalomma asiaticum kozlovi Olenev, 1931 (Acari: Ixodidae) under laboratory conditions. https://doi.org/10.1016/j.vetpar.2008.10.028
22
Kazimírová M, Štibrániová I. (2013). Tick salivary compounds: their role in modulation of host defences and pathogen transmission. https://doi.org/10.3389/fcimb.2013.00043
23
Ioffe-Uspensky I, Uspensky I. (2017). The influence of preprandial insemination on feeding and oviposition of Ixodes persulcatus females (Acari: Ixodidae) and some thoughts concerning mating strategies in ticks of the genus Ixodes. https://doi.org/10.1016/j.ttbdis.2017.07.003
24
Celebi FM, Chou S, McGeever E, Patton AH, York R. (2023). NovelTree: Highly parallelized phylogenomic inference. https://doi.org/10.57844/arcadia-z08x-v798
25
Militzer N, Bartel A, Clausen P-H, Hoffmann-Köhler P, Nijhof AM. (2021). Artificial Feeding of All Consecutive Life Stages of Ixodes ricinus. https://doi.org/10.3390/vaccines9040385
26
Dunlop JA. (2021). Chelicerates as Parasites. https://doi.org/10.1007/978-3-030-42484-8_9
27
Traub R, Wisseman CL. (1974). Review Article*: The Ecology of Chigger-Borne Rickettsiosis (Scrub Typhus)1, 2. https://doi.org/10.1093/jmedent/11.3.237
28
McCoy KD, Léger E, Dietrich M. (2013). Host specialization in ticks and transmission of tick-borne diseases: a review. https://doi.org/10.3389/fcimb.2013.00057
29
Lv Y, Guo X-G, Jin D-C. (2018). Research Progress on Leptotrombidium deliense. https://doi.org/10.3347/kjp.2018.56.4.313
30
Celebi FM, Chou S, McDaniel EA, Reiter T, Weiss EC. (2024). Predicted genes from the Amblyomma americanum draft genome assembly. https://doi.org/10.57844/arcadia-9602-3351
31
Walls EA, Berkson J, Smith SA. (2002). The Horseshoe Crab, Limulus polyphemus: 200 Million Years of Existence, 100 Years of Study. https://doi.org/10.1080/20026491051677
32
Emms DM, Kelly S. (2015). OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. https://doi.org/10.1186/s13059-015-0721-2
33
Dietz L, Dömel JS, Leese F, Lehmann T, Melzer RR. (2018). Feeding ecology in sea spiders (Arthropoda: Pycnogonida): what do we know?. https://doi.org/10.1186/s12983-018-0250-4
34
Lozano-Fernandez J, Tanner AR, Giacomelli M, Carton R, Vinther J, Edgecombe GD, Pisani D. (2019). Increasing species sampling in chelicerate genomic-scale datasets provides support for monophyly of Acari and Arachnida. https://doi.org/10.1038/s41467-019-10244-7
35
STOCK JH. (1978). Experiments on food preference and chemical sense in Pycnogonida. https://doi.org/10.1111/j.1096-3642.1978.tb02090.x
36
Giribet G, Edgecombe GD. (2019). The Phylogeny and Evolutionary History of Arthropods. https://doi.org/10.1016/j.cub.2019.04.057
37
Estrada-Pena A, Mangold A, Nava S, Venzal J, Labruna M, Guglielmone A. (2010). A review of the systematics of the tick family Argasidae (Ixodida). https://doi.org/10.1051/acarologia/20101975
38
Ballesteros JA, Sharma PP. (2019). A Critical Appraisal of the Placement of Xiphosura (Chelicerata) with Account of Known Sources of Phylogenetic Error. https://doi.org/10.1093/sysbio/syz011
39
Murillo AC, Mullens BA. (2017). A review of the biology, ecology, and control of the northern fowl mite, Ornithonyssus sylviarum (Acari: Macronyssidae). https://doi.org/10.1016/j.vetpar.2017.09.002
40
Manni M, Berkeley MR, Seppey M, Simão FA, Zdobnov EM. (2021). BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes. https://doi.org/10.1093/molbev/msab199
41
Allard CM, Yeargan KV. (2005). Effect of Diet on Development and Reproduction of the HarvestmanPhalangium opilio(Opiliones: Phalangiidae). https://doi.org/10.1603/0046-225x-34.1.6
42
Rafferty DE, Gray JS. (1987). The Feeding Behaviour of Psoroptes spp. Mites on Rabbits and Sheep. https://doi.org/10.2307/3282508
43
Moraza ML, Prieto CE, Balanzategui I. (2021). A new species of the genus Opilioacarus With, 1902 (Acari: Opilioacarida) for the Iberian Peninsula. https://doi.org/10.24349/acarologia/20214422
44
Guglielmone AA, Robbins RG. (2018). Hard Ticks (Acari: Ixodida: Ixodidae) Parasitizing Humans. https://doi.org/10.1007/978-3-319-95552-0
45
van den Broek A, Huntley J. (2003). Sheep Scab: the Disease, Pathogenesis and Control. https://doi.org/10.1053/jcpa.2002.0627
46
Troughton DR, Levin ML. (2007). Life Cycles of Seven Ixodid Tick Species (Acari: Ixodidae) Under Standardized Laboratory Conditions. https://doi.org/10.1093/jmedent/44.5.732
47
Wilhelm E, Bückmann D, Tomaschko K-H. (1997). Life cycle and population dynamics of Pycnogonum litorale (Pycnogonida) in a natural habitat. https://doi.org/10.1007/s002270050202
48
Wong SF, Chong AL, Mak JW, Tan J, Ling SJ, Ho TM. (2011). Molecular identification of house dust mites and storage mites. https://doi.org/10.1007/s10493-011-9460-6
49
Senbill H, Hazarika LK, Baruah A, Borah DK, Bhattacharyya B, Rahman S. (2018). Life cycle of the southern cattle tick, Rhipicephalus (Boophilus) microplus Canestrini 1888 (Acari: Ixodidae) under laboratory conditions. https://doi.org/10.11158/saa.23.6.12
50
Howarth J, Hokama Y. (1983). Artificial Feeding of Adult and Nymphal Dermacentor Andersoni (Acari: Ixodidae) During Studies on Bovine Anaplasmosis1. https://doi.org/10.1093/jmedent/20.3.248
51
Cooke JAL. (1967). Obsevations on the biology of Ricinulei (Arachnida) with descriptions of two new species of Cryptocellus. https://doi.org/10.1111/j.1469-7998.1967.tb02864.x
52
Arlian LG, Morgan MS. (2017). A review of Sarcoptes scabiei: past, present and future. https://doi.org/10.1186/s13071-017-2234-1
53
de Bivort BL, Clouse RM, Giribet G. (2012). A cladistic reconstruction of the ancestral mite harvestman (Arachnida, Opiliones, Cyphophthalmi): portrait of a Paleozoic detritivore. https://doi.org/10.1111/j.1096-0031.2012.00407.x
54
Bensoussan N, Santamaria ME, Zhurov V, Diaz I, Grbić M, Grbić V. (2016). Plant-Herbivore Interaction: Dissection of the Cellular Pattern of Tetranychus urticae Feeding on the Host Plant. https://doi.org/10.3389/fpls.2016.01105
55
de Guzman LI, Williams GR, Khongphinitbunjong K, Chantawannakul P. (2017). Ecology, Life History, and Management of Tropilaelaps Mites. https://doi.org/10.1093/jee/tow304
56
Smrž J, Soukalová H, Čatská V, Hubert J. (2016). Feeding Patterns ofTyrophagus putrescentiae(Sarcoptiformes: Acaridae) Indicate That Mycophagy Is Not a Single and Homogeneous Category of Nutritional Biology. https://doi.org/10.1093/jisesa/iew070
57
Ramsey SD, Ochoa R, Bauchan G, Gulbronson C, Mowery JD, Cohen A, Lim D, Joklik J, Cicero JM, Ellis JD, Hawthorne D, vanEngelsdorp D. (2019). Varroa destructorfeeds primarily on honey bee fat body tissue and not hemolymph. https://doi.org/10.1073/pnas.1818371116
58
Li AY, Cook SC, Sonenshine DE, Posada-Florez F, Noble NI, Mowery J, Gulbronson CJ, Bauchan GR. (2019). Insights into the feeding behaviors and biomechanics of Varroa destructor mites on honey bee pupae using electropenetrography and histology. https://doi.org/10.1016/j.jinsphys.2019.103950
59
Bhat SA, Mounsey KE, Liu X, Walton SF. (2017). Host immune responses to the itch mite, Sarcoptes scabiei, in humans. https://doi.org/10.1186/s13071-017-2320-4
60
Mellanby K. (1944). The development of symptoms, parasitic infection and immunity in human scabies. https://doi.org/10.1017/s0031182000021612
61
Shen C, Park M, Warnow T. (2022). WITCH: Improved Multiple Sequence Alignment Through Weighted Consensus Hidden Markov Model Alignment. https://doi.org/10.1089/cmb.2021.0585
62
Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, Lanfear R. (2020). Corrigendum to: IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. https://doi.org/10.1093/molbev/msaa131
63
Morel B, Kozlov AM, Stamatakis A, Szöllősi GJ. (2020). GeneRax: A Tool for Species-Tree-Aware Maximum Likelihood-Based Gene Family Tree Inference under Gene Duplication, Transfer, and Loss. https://doi.org/10.1093/molbev/msaa141
64
Date SV, Peregrín-Alvarez JM. (2008). Phylogenetic Profiling. https://doi.org/10.1007/978-1-60327-429-6_9
65
Butler MA, Schoener TW, Losos JB. (2000). THE RELATIONSHIP BETWEEN SEXUAL SIZE DIMORPHISM AND HABITAT USE IN GREATER ANTILLEANANOLISLIZARDS. https://doi.org/10.1111/j.0014-3820.2000.tb00026.x
66
Avasthi P, McGeever E, Patton AH, York R. (2024). Leveraging evolution to identify novel organismal models of human biology. https://doi.org/10.57844/arcadia-33b4-4dc5
67
Hahsler M, Piekenbrock M, Doran D. (2019). <b>dbscan</b>: Fast Density-Based Clustering with R. https://doi.org/10.18637/jss.v091.i01
68
Csárdi G, Nepusz T, Traag V, Horvát S, Zanini F, Noom D, Müller K, Schoch D, Salmon M. (2006). igraph: Network Analysis and Visualization. https://doi.org/10.32614/cran.package.igraph
69
Traag VA, Waltman L, van Eck NJ. (2019). From Louvain to Leiden: guaranteeing well-connected communities. https://doi.org/10.1038/s41598-019-41695-z
70
Clauset A, Newman MEJ, Moore C. (2004). Finding community structure in very large networks. https://doi.org/10.1103/physreve.70.066111
71
Pons P, Latapy M. (2005). Computing communities in large networks using random walks (long version). https://doi.org/10.48550/arxiv.physics/0512106
72
Rosvall M, Bergstrom CT. (2008). Maps of random walks on complex networks reveal community structure. https://doi.org/10.1073/pnas.0706851105
73
Rosvall M, Axelsson D, Bergstrom CT. (2009). The map equation. https://doi.org/10.1140/epjst/e2010-01179-1
74
McInnes L, Healy J, Melville J. (2018). UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. https://doi.org/10.48550/arxiv.1802.03426
75
Sievert C. (2020). Interactive Web-Based Data Visualization with R, plotly, and shiny. https://doi.org/10.1201/9780429447273
76
Tung Ho Ls, Ané C. (2014). A Linear-Time Algorithm for Gaussian and Non-Gaussian Trait Evolution Models. https://doi.org/10.1093/sysbio/syu005
77
Savojardo C, Martelli PL, Fariselli P, Casadio R. (2017). DeepSig: deep learning improves signal peptide detection in proteins. https://doi.org/10.1093/bioinformatics/btx818
78
Hallgren J, Tsirigos KD, Pedersen MD, Almagro Armenteros JJ, Marcatili P, Nielsen H, Krogh A, Winther O. (2022). DeepTMHMM predicts alpha and beta transmembrane proteins using deep neural networks. https://doi.org/10.1101/2022.04.08.487609
79
Chou S, Hochstrasser ML, Poskanzer KE, Thuy-Boun PS. (2022). Robust long-read saliva transcriptome and proteome from the lone star tick, Amblyomma americanum. https://doi.org/10.57844/arcadia-3hyh-3h83
80
Borges AL, Reiter T, Weiss EC. (2025). An interactive visualization tool for Amblyomma americanum differential expression data. https://doi.org/10.57844/arcadia-6833-4117
81
Morel B, Schade P, Lutteropp S, Williams TA, Szöllősi GJ, Stamatakis A. (2022). SpeciesRax: A Tool for Maximum Likelihood Species Tree Inference from Gene Family Trees under Duplication, Transfer, and Loss. https://doi.org/10.1093/molbev/msab365
82
Sack GH. (2018). Serum amyloid A – a review. https://doi.org/10.1186/s10020-018-0047-0
83
Sano T, Huang W, Hall J, Yang Y, Chen A, Gavzy S, Lee J-Y, Ziel JW, Miraldi E, Domingos A, Bonneau R, Littman D. (2015). An IL-23R/IL-22 Circuit Regulates Epithelial Serum Amyloid A to Promote Local Effector Th17 Responses. https://doi.org/10.1016/j.cell.2015.08.061
84
Chen R, Chen Q, Zheng J, Zeng Z, Chen M, Li L, Zhang S. (2023). Serum amyloid protein A in inflammatory bowel disease: from bench to bedside. https://doi.org/10.1038/s41420-023-01455-5
85
Smole U, Gour N, Phelan J, Hofer G, Köhler C, Kratzer B, Tauber PA, Xiao X, Yao N, Dvorak J, Caraballo L, Puerta L, Rosskopf S, Chakir J, Malle E, Lane AP, Pickl WF, Lajoie S, Wills-Karp M. (2020). Serum amyloid A is a soluble pattern recognition receptor that drives type 2 immunity. https://doi.org/10.1038/s41590-020-0698-1
86
Roversi P, Lissina O, Johnson S, Ahmat N, Paesen GC, Ploss K, Boland W, Nunn MA, Lea SM. (2007). The Structure of OMCI, a Novel Lipocalin Inhibitor of the Complement System. https://doi.org/10.1016/j.jmb.2007.03.064
87
Denisov SS, Dijkgraaf I. (2021). Immunomodulatory Proteins in Tick Saliva From a Structural Perspective. https://doi.org/10.3389/fcimb.2021.769574
88
Ochieng J, Chaudhuri G. (2010). Cystatin Superfamily. https://doi.org/10.1353/hpu.0.0257
89
Lalmanach G, Naudin C, Lecaille F, Fritz H. (2010). Kininogens: More than cysteine protease inhibitors and kinin precursors. https://doi.org/10.1016/j.biochi.2010.03.011
90
Pal D, Dasgupta S, Kundu R, Maitra S, Das G, Mukhopadhyay S, Ray S, Majumdar SS, Bhattacharya S. (2012). Fetuin-A acts as an endogenous ligand of TLR4 to promote lipid-induced insulin resistance. https://doi.org/10.1038/nm.2851
91
Demetriou M, Binkert C, Sukhu B, Tenenbaum HC, Dennis JW. (1996). Fetuin/α2-HS Glycoprotein Is a Transforming Growth Factor-β Type II Receptor Mimic and Cytokine Antagonist. https://doi.org/10.1074/jbc.271.22.12755
92
Gao X, Tian Y, Liu Z-l, Li D, Liu J-j, Yu G-x, Duan D-y, Peng T, Cheng T-y, Liu L. (2024). Tick salivary protein Cystatin: structure, anti-inflammation and molecular mechanism. https://doi.org/10.1016/j.ttbdis.2023.102289
93
Chmelař J, Kotál J, Langhansová H, Kotsyfakis M. (2017). Protease Inhibitors in Tick Saliva: The Role of Serpins and Cystatins in Tick-host-Pathogen Interaction. https://doi.org/10.3389/fcimb.2017.00216
94
Chung K, Pitcher T, Grant AD, Hewitt E, Lindstrom E, Malcangio M. (2019). Cathepsin S acts via protease-activated receptor 2 to activate sensory neurons and induce itch-like behaviour. https://doi.org/10.1016/j.ynpai.2019.100032
95
Montague-Cardoso K, Malcangio M. (2020). Cathepsin S as a potential therapeutic target for chronic pain. https://doi.org/10.1016/j.medidd.2020.100047
96
Conus S, Simon H-U. (2008). Cathepsins: Key modulators of cell death and inflammatory responses. https://doi.org/10.1016/j.bcp.2008.07.041
97
Siming GAO , Honglin ZHU , Xiaoxia ZUO , Hui LUO . (2018). Cathepsin G and Its Role in Inflammation and Autoimmune Diseases. https://doi.org/10.5606/archrheumatol.2018.6595
98
Shafee TMA, Lay FT, Phan TK, Anderson MA, Hulett MD. (2016). Convergent evolution of defensin sequence, structure and function. https://doi.org/10.1007/s00018-016-2344-5
99
Wu J, Zhou X, Chen Q, Chen Z, Zhang J, Yang L, Sun Y, Wang G, Dai J, Feng T. (2022). Defensins as a promising class of tick antimicrobial peptides: a scoping review. https://doi.org/10.1186/s40249-022-00996-8
100
Zhang Z, Zhu S. (2009). Drosomycin, an essential component of antifungal defence in Drosophila. https://doi.org/10.1111/j.1365-2583.2009.00907.x
101
Li X, Yang H, Han Y, Yin S, Shen B, Wu Y, Li W, Cao Z. (2021). Tick peptides evoke itch by activating MrgprC11/MRGPRX1 to sensitize TRPV1 in pruriceptors. https://doi.org/10.1016/j.jaci.2020.12.626
102
Schraiber JG, Edge MD, Pennell M. (2024). Unifying approaches from statistical genetics and phylogenetics for mapping phenotypes in structured populations. https://doi.org/10.1101/2024.02.10.579721
103
David KT, Schraiber JG, Crandall JG, LaBella AL, Opulente DA, Harrison M-C, Wolters JF, Zhou X, Shen X-X, Groenewald M, Hittinger CT, Pennell M, Rokas A. (2024). Convergent expansions of keystone gene families drive metabolic innovation in a major eukaryotic clade. https://doi.org/10.1101/2024.07.22.604484
Log inorsign upto leave a comment.

Loading comments …