Summary: CRISPR associated protein Cas2
Pfam includes annotations and additional family information from a range of different sources. These sources can be accessed via the tabs below.
This is the Wikipedia entry entitled "CRISPR". More...
The Wikipedia text that you see displayed here is a download from Wikipedia. This means that the information we display is a copy of the information from the Wikipedia database. The button next to the article title ("Edit Wikipedia article") takes you to the edit page for the article directly within Wikipedia. You should be aware you are not editing our local copy of this information. Any changes that you make to the Wikipedia article will not be displayed here until we next download the article from Wikipedia. We currently download new content on a nightly basis.
Does Pfam agree with the content of the Wikipedia entry ?
Pfam has chosen to link families to Wikipedia articles. In some case we have created or edited these articles but in many other cases we have not made any direct contribution to the content of the article. The Wikipedia community does monitor edits to try to ensure that (a) the quality of article annotation increases, and (b) vandalism is very quickly dealt with. However, we would like to emphasise that Pfam does not curate the Wikipedia entries and we cannot guarantee the accuracy of the information on the Wikipedia page.
Editing Wikipedia articles
Before you edit for the first time
Wikipedia is a free, online encyclopedia. Although anyone can edit or contribute to an article, Wikipedia has some strong editing guidelines and policies, which promote the Wikipedia standard of style and etiquette. Your edits and contributions are more likely to be accepted (and remain) if they are in accordance with this policy.
You should take a few minutes to view the following pages:
How your contribution will be recorded
Anyone can edit a Wikipedia entry. You can do this either as a new user or you can register with Wikipedia and log on. When you click on the "Edit Wikipedia article" button, your browser will direct you to the edit page for this entry in Wikipedia. If you are a registered user and currently logged in, your changes will be recorded under your Wikipedia user name. However, if you are not a registered user or are not logged on, your changes will be logged under your computer's IP address. This has two main implications. Firstly, as a registered Wikipedia user your edits are more likely seen as valuable contribution (although all edits are open to community scrutiny regardless). Secondly, if you edit under an IP address you may be sharing this IP address with other users. If your IP address has previously been blocked (due to being flagged as a source of 'vandalism') your edits will also be blocked. You can find more information on this and creating a user account at Wikipedia.
If you have problems editing a particular page, contact us at firstname.lastname@example.org and we will try to help.
The community annotation is a new facility of the Pfam web site. If you have problems editing or experience problems with these pages please contact us.
CRISPR Edit Wikipedia article
This article relies too much on references to primary sources. (July 2018) (Learn how and when to remove this template message)
|Cascade (CRISPR-associated complex for antiviral defense)|
CRISPR Cascade protein (cyan) bound to CRISPR RNA (green) and phage DNA (red)
CRISPR (//) (clustered regularly interspaced short palindromic repeats) is a family of DNA sequences found within the genomes of prokaryotic organisms such as bacteria and archaea. These sequences are derived from DNA fragments of bacteriophages that have previously infected the prokaryote and are used to detect and destroy DNA from similar phages during subsequent infections. Hence these sequences play a key role in the antiviral (i.e. anti-phage) defense system of prokaryotes.
Cas9 (or "CRISPR-associated protein 9") is an enzyme that uses CRISPR sequences as a guide to recognize and cleave specific strands of DNA that are complementary to the CRISPR sequence. Cas9 enzymes together with CRISPR sequences form the basis of a technology known as CRISPR-Cas9 that can be used to edit genes within organisms. This editing process has a wide variety of applications including basic biological research, development of biotechnology products, and treatment of diseases.
The CRISPR-Cas system is a prokaryotic immune system that confers resistance to foreign genetic elements such as those present within plasmids and phages that provides a form of acquired immunity. RNA harboring the spacer sequence helps Cas (CRISPR-associated) proteins recognize and cut foreign pathogenic DNA. Other RNA-guided Cas proteins cut foreign RNA. CRISPR are found in approximately 50% of sequenced bacterial genomes and nearly 90% of sequenced archaea.
- 1 History
- 2 Locus structure
- 3 Mechanism
- 4 Evolution
- 5 Identification
- 6 Use by phages
- 7 Applications
- 8 See also
- 9 References
- 10 Further reading
- 11 External links
The discovery of clustered DNA repeats occurred independently in three parts of the world. The first description of what would later be called CRISPR is from Osaka University researcher Yoshizumi Ishino and his colleagues in 1987. They accidentally cloned part of a CRISPR sequence together with the "iap" gene (isozyme conversion of alkaline phosphatase) that was their target. The organization of the repeats was unusual. Repeated sequences are typically arranged consecutively, without interspersed different sequences. They did not know the function of the interrupted clustered repeats.
In 1993, researchers of Mycobacterium tuberculosis in the Netherlands published two articles about a cluster of interrupted direct repeats (DR) in that bacterium. They recognized the diversity of the sequences that intervened the direct repeats among different strains of M. tuberculosis and used this property to design a typing method that was named spoligotyping, which is still in use today.
At the same time, repeats were observed in the archaeal organisms of Haloferax and Haloarcula species, and their function was studied by Francisco Mojica at the University of Alicante in Spain. Although his hypothesis turned out to be wrong, Mojica's supervisor surmised at the time that the clustered repeats had a role in correctly segregating replicated DNA into daughter cells during cell division because plasmids and chromosomes with identical repeat arrays could not coexist in Haloferax volcanii. Transcription of the interrupted repeats was also noted for the first time. By 2000, Mojica performed a survey of scientific literature and one of his students performed a search in published genomes with a program devised by himself. They identified interrupted repeats in 20 species of microbes as belonging to the same family. In 2001, Mojica and Ruud Jansen, who were searching for additional interrupted repeats, proposed the acronym CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) to alleviate the confusion stemming from the numerous acronyms used to describe the sequences in the scientific literature. In 2002, Tang, et al. showed evidence that CRISPR repeat regions from the genome of Archaeoglobus fulgidus were transcribed into long RNA molecules that were subsequently processed into unit-length small RNAs, plus some longer forms of 2, 3, or more spacer-repeat units.
A major addition to the understanding of CRISPR came with Jansen's observation that the prokaryote repeat cluster was accompanied by a set of homologous genes that make up CRISPR-associated systems or cas genes. Four cas genes (cas 1â€“4) were initially recognized. The Cas proteins showed helicase and nuclease motifs, suggesting a role in the dynamic structure of the CRISPR loci. In this publication the acronym CRISPR was used as the universal name of this pattern. However, the CRISPR function remained enigmatic.
In 2005, three independent research groups showed that some CRISPR spacers are derived from phage DNA and extrachromosomal DNA such as plasmids. In effect, the spacers are fragments of DNA gathered from viruses that previously tried to attack the cell. The source of the spacers was a sign that the CRISPR/cas system could have a role in adaptive immunity in bacteria. All three studies proposing this idea were initially rejected by high-profile journals, but eventually appeared in other journals.
The first publication proposing a role of CRISPR-Cas in microbial immunity, by the researchers at the University of Alicante, predicted a role for the RNA transcript of spacers on target recognition in a mechanism that could be analogous to the RNA interference system used by eukaryotic cells. Koonin and colleagues extended this RNA interference hypothesis by proposing mechanisms of action for the different CRISPR-Cas subtypes according to the predicted function of their proteins.
Experimental work by several groups revealed the basic mechanisms of CRISPR-Cas immunity. In 2007, the first experimental evidence that CRISPR was an adaptive immune system was published. A CRISPR region in Streptococcus thermophilus acquired spacers from the DNA of an infecting bacteriophage. The researchers manipulated the resistance of S. thermophilus to different types of phage by adding and deleting spacers whose sequence matched those found in the tested phages. In 2008, Brouns and Van der Oost identified a complex of Cas proteins (called Cascade) that in E. coli cut the CRISPR RNA precursor within the repeats into mature spacer-containing RNA molecules called CRISPR RNA (crRNA), which remained bound to the protein complex. Moreover, it was found that Cascade, crRNA and a helicase/nuclease (Cas3) were required to provide a bacterial host with immunity against infection by a DNA virus. By designing an anti-virus CRISPR, they demonstrated that two orientations of the crRNA (sense/antisense) provided immunity, indicating that the crRNA guides were targeting dsDNA. That year Marraffini and Sontheimer confirmed that a CRISPR sequence of S. epidermidis targeted DNA and not RNA to prevent conjugation. This finding was at odds with the proposed RNA-interference-like mechanism of CRISPR-Cas immunity, although a CRISPR-Cas system that targets foreign RNA was later found in Pyrococcus furiosus. A 2010 study showed that CRISPR-Cas cuts both strands of phage and plasmid DNA in S. thermophilus.
Researchers studied a simpler CRISPR system from Streptococcus pyogenes that relies on the protein Cas9. The Cas9 endonuclease is a four-component system that includes two small crRNA molecules and trans-activating CRISPR RNA (tracrRNA). Jennifer Doudna and Emmanuelle Charpentier re-engineered the Cas9 endonuclease into a more manageable two-component system by fusing the two RNA molecules into a "single-guide RNA" that, when combined with Cas9, could find and cut the DNA target specified by the guide RNA. By manipulating the nucleotide sequence of the guide RNA, the artificial Cas9 system could be programmed to target any DNA sequence for cleavage. Another group of collaborators comprising Virginijus Å ikÅ¡nys together with GasiÅ«nas, Barrangou and Horvath showed that Cas9 from the S. thermophilus CRISPR system can also be reprogrammed to target a site of their choosing by changing the sequence of its crRNA. These advances fueled efforts to edit genomes with the modified CRISPR-Cas9 system.
Groups led by Feng Zhang and George Church simultaneously published descriptions of genome editing in human cell cultures using CRISPR-Cas9 for the first time. It has since been used in a wide range of organisms, including baker's yeast (Saccharomyces cerevisiae), the opportunistic pathogen Candida albicans, zebrafish (Danio rerio), fruit flies (Drosophila melanogaster), ants (Harpegnathos saltator and Ooceraea biroi), mosquitoes (Aedes aegypti), nematodes (Caenorhabditis elegans), plants, mice, monkeys and human embryos.
The CRISPR-Cas9 system has shown to make effective gene edits in Human tripronuclear zygotes first described in a 2015 paper by Chinese scientists P. Liang and Y. Xu. The system made a successful cleavage of mutant Beta-Hemoglobin (HBB) in 28 out of 54 embryos. 4 out of the 28 embryos were successfully recombined using a donor template given by the scientists. The scientists showed that during DNA recombination of the cleaved strand, the homologous endogenous sequence HBD competes with the exogenous donor template. DNA repair in human embryos is much more complicated and particular than in derived stem cells.
Cas12a (formerly Cpf1)
In 2015, the nuclease Cas12a (formerly known as Cpf1) was characterized in the CRISPR/Cpf1 system of the bacterium Francisella novicida. Its original name, from a TIGRFAMs protein family definition built in 2012, reflects the prevalence of its CRISPR-Cas subtype in the Prevotella and Francisella lineages. Cas12a showed several key differences from Cas9 including: causing a 'staggered' cut in double stranded DNA as opposed to the 'blunt' cut produced by Cas9, relying on a 'T rich' PAM (providing alternative targeting sites to Cas9) and requiring only a CRISPR RNA (crRNA) for successful targeting. By contrast Cas9 requires both crRNA and a transactivating crRNA (tracrRNA).
These differences may give Cas12a some advantages over Cas9. For example, Cas12a's small crRNAs are ideal for multiplexed genome editing, as more of them can be packaged in one vector than can Cas9's sgRNAs. As well, the sticky 5â€² overhangs left by Cas12a can be used for DNA assembly that is much more target-specific than traditional Restriction Enzyme cloning. Finally, Cas12a cleaves DNA 18â€“23 base pairs downstream from the PAM site. This means there is no disruption to the recognition sequence after repair, and so Cas12a enables multiple rounds of DNA cleavage. By contrast, since Cas9 cuts only 3 base pairs upstream of the PAM site, the NHEJ pathway results in indel mutations which destroy the recognition sequence, thereby preventing further rounds of cutting. In theory, repeated rounds of DNA cleavage should cause an increased opportunity for the desired genomic editing to occur.
Repeats and spacers
The CRISPR array is made up of an AT-rich leader sequence followed by short repeats that are separated by unique spacers. CRISPR repeats typically range in size from 28 to 37 base pairs (bps), though there can be as few as 23 bp and as many as 55 bp. Some show dyad symmetry, implying the formation of a secondary structure such as a stem-loop ('hairpin') in the RNA, while others are designed to be unstructured. The size of spacers in different CRISPR arrays is typically 32 to 38 bp (range 21 to 72 bp). New spacers can appear rapidly as part of the immune response to phage infection. There are usually fewer than 50 units of the repeat-spacer sequence in a CRISPR array.
CRISPR RNA structures
Cas genes and CRISPR subtypes
Small clusters of cas genes are often located next to CRISPR repeat-spacer arrays. Collectively the 93 cas genes are grouped into 35 families based on sequence similarity of the encoded proteins. 11 of the 35 families form the cas core, which includes the protein families Cas1 through Cas9. A complete CRISPR-Cas locus has at least one gene belonging to the cas core.
CRISPR-Cas systems fall into two classes. Class 1 systems use a complex of multiple Cas proteins to degrade foreign nucleic acids. Class 2 systems use a single large Cas protein for the same purpose. Class 1 is divided into types I, III, and IV; class 2 is divided into types II, V, and VI. The 6 system types are divided into 19 subtypes. Each type and most subtypes are characterized by a "signature gene" found almost exclusively in the category. Classification is also based on the complement of cas genes that are present. Most CRISPR-Cas systems have a Cas1 protein. The phylogeny of Cas1 proteins generally agrees with the classification system. Many organisms contain multiple CRISPR-Cas systems suggesting that they are compatible and may share components. The sporadic distribution of the CRISPR/Cas subtypes suggests that the CRISPR/Cas system is subject to horizontal gene transfer during microbial evolution.
|Class||Cas type||Signature protein||Function||Reference|
|1||I||Cas3||Single-stranded DNA nuclease (HD domain) and ATP-dependent helicase|||
|IA||Cas8a, Cas5||Subunit of the interference module. Important in targeting of invading DNA by recognizing the PAM sequence|||
|ID||Cas10d||contains a domain homologous to the palm domain of nucleic acid polymerases and nucleotide cyclases|||
|IF||Csy1, Csy2, Csy3||Not determined|||
|III||Cas10||Homolog of Cas10d and Cse1|||
|IIIC||Cas10 or Csx11|||
|2||II||Cas9||Nucleases RuvC and HNH together produce DSBs, and separately can produce single-strand breaks. Ensures the acquisition of functional spacers during adaptation.|||
|IIA||Csn2||Ring-shaped DNA-binding protein. Involved in primed adaptation in Type II CRISPR system.|||
|IIC||Characterized by the absence of either Csn2 or Cas4|||
|V||Cpf1, C2c1, C2c3||Nuclease RuvC. Lacks HNH.|||
|VI||Cas13a (previously known as C2c2), Cas13b, Cas13c, Cas13d||RNA-guided RNase|||
CRISPR-Cas immunity is a natural process of bacteria and archaea. CRISPR-Cas prevents bacteriophage infection, conjugation and natural transformation by degrading foreign nucleic acids that enter the cell.
When a microbe is invaded by a bacteriophage, the first stage of the immune response is to capture phage DNA and insert it into a CRISPR locus in the form of a spacer. Cas1 and Cas2 are found in both types of CRISPR-Cas immune systems, which indicates that they are involved in spacer acquisition. Mutation studies confirmed this hypothesis, showing that removal of cas1 or cas2 stopped spacer acquisition, without affecting CRISPR immune response.
Multiple Cas1 proteins have been characterised and their structures resolved. Cas1 proteins have diverse amino acid sequences. However, their crystal structures are similar and all purified Cas1 proteins are metal-dependent nucleases/integrases that bind to DNA in a sequence-independent manner. Representative Cas2 proteins have been characterised and possess either (single strand) ssRNA- or (double strand) dsDNA- specific endoribonuclease activity.
In the I-E system of E. coli Cas1 and Cas2 form a complex where a Cas2 dimer bridges two Cas1 dimers. In this complex Cas2 performs a non-enzymatic scaffolding role, binding double-stranded fragments of invading DNA, while Cas1 binds the single-stranded flanks of the DNA and catalyses their integration into CRISPR arrays. New spacers are usually added at the beginning of the CRISPR next to the leader sequence creating a chronological record of viral infections. In E. coli a histone like protein called integration host factor (IHF), which binds to the leader sequence, is responsible for the accuracy of this integration. IHF also enhances integration efficiency in the type I-F system of Pectobacterium atrosepticum. but in other systems different host factors may be required
Protospacer adjacent motifs
Bioinformatic analysis of regions of phage genomes that were excised as spacers (termed protospacers) revealed that they were not randomly selected but instead were found adjacent to short (3â€“5 bp) DNA sequences termed protospacer adjacent motifs (PAM). Analysis of CRISPR-Cas systems showed PAMs to be important for type I and type II, but not type III systems during acquisition. In type I and type II systems, protospacers are excised at positions adjacent to a PAM sequence, with the other end of the spacer cut using a ruler mechanism, thus maintaining the regularity of the spacer size in the CRISPR array. The conservation of the PAM sequence differs between CRISPR-Cas systems and appears to be evolutionarily linked to Cas1 and the leader sequence.
New spacers are added to a CRISPR array in a directional manner, occurring preferentially, but not exclusively, adjacent to the leader sequence. Analysis of the type I-E system from E. coli demonstrated that the first direct repeat adjacent to the leader sequence, is copied, with the newly acquired spacer inserted between the first and second direct repeats.
The PAM sequence appears to be important during spacer insertion in type I-E systems. That sequence contains a strongly conserved final nucleotide (nt) adjacent to the first nt of the protospacer. This nt becomes the final base in the first direct repeat. This suggests that the spacer acquisition machinery generates single-stranded overhangs in the second-to-last position of the direct repeat and in the PAM during spacer insertion. However, not all CRISPR-Cas systems appear to share this mechanism as PAMs in other organisms do not show the same level of conservation in the final position. It is likely that in those systems, a blunt end is generated at the very end of the direct repeat and the protospacer during acquisition.
Analysis of Sulfolobus solfataricus CRISPRs revealed further complexities to the canonical model of spacer insertion, as one of its six CRISPR loci inserted new spacers randomly throughout its CRISPR array, as opposed to inserting closest to the leader sequence.
Multiple CRISPRs contain many spacers to the same phage. The mechanism that causes this phenomenon was discovered in the type I-E system of E. coli. A significant enhancement in spacer acquisition was detected where spacers already target the phage, even mismatches to the protospacer. This â€˜primingâ€™ requires the Cas proteins involved in both acquisition and interference to interact with each other. Newly acquired spacers that result from the priming mechanism are always found on the same strand as the priming spacer. This observation led to the hypothesis that the acquisition machinery slides along the foreign DNA after priming to find a new protospacer.
CRISPR-RNA (crRNA), which later guides the Cas nuclease to the target during the interference step, must be generated from the CRISPR sequence. The crRNA is initially transcribed as part of a single long transcript encompassing much of the CRISPR array. This transcript is then cleaved by Cas proteins to form crRNAs. The mechanism to produce crRNAs differs among CRISPR/Cas systems. In type I-E and type I-F systems, the proteins Cas6e and Cas6f respectively, recognise stem-loops created by the pairing of identical repeats that flank the crRNA. These Cas proteins cleave the longer transcript at the edge of the paired region, leaving a single crRNA along with a small remnant of the paired repeat region.
Type III systems also use Cas6, however their repeats do not produce stem-loops. Cleavage instead occurs by the longer transcript wrapping around the Cas6 to allow cleavage just upstream of the repeat sequence.
Type II systems lack the Cas6 gene and instead utilize RNaseIII for cleavage. Functional type II systems encode an extra small RNA that is complementary to the repeat sequence, known as a trans-activating crRNA (tracrRNA). Transcription of the tracrRNA and the primary CRISPR transcript results in base pairing and the formation of dsRNA at the repeat sequence, which is subsequently targeted by RNaseIII to produce crRNAs. Unlike the other two systems the crRNA does not contain the full spacer, which is instead truncated at one end.
CrRNAs associate with Cas proteins to form ribonucleotide complexes that recognize foreign nucleic acids. CrRNAs show no preference between the coding and non-coding strands, which is indicative of an RNA-guided DNA-targeting system. The type I-E complex (commonly referred to as Cascade) requires five Cas proteins bound to a single crRNA.
During the interference stage in type I systems the PAM sequence is recognized on the crRNA-complementary strand and is required along with crRNA annealing. In type I systems correct base pairing between the crRNA and the protospacer signals a conformational change in Cascade that recruits Cas3 for DNA degradation.
Type II systems rely on a single multifunctional protein, Cas9, for the interference step. Cas9 requires both the crRNA and the tracrRNA to function and cleaves DNA using its dual HNH and RuvC/RNaseH-like endonuclease domains. Basepairing between the PAM and the phage genome is required in type II systems. However, the PAM is recognized on the same strand as the crRNA (the opposite strand to type I systems).
Type III systems, like type I require six or seven Cas proteins binding to crRNAs. The type III systems analysed from S. solfataricus and P. furiosus both target the mRNA of phages rather than phage DNA genome, which may make these systems uniquely capable of targeting RNA-based phage genomes. Type III systems were also found to target DNA in addition to RNA using a different Cas protein in the complex, Cas10. The DNA cleavage was shown to be transcription dependent.
The mechanism for distinguishing self from foreign DNA during interference is built into the crRNAs and is therefore likely common to all three systems. Throughout the distinctive maturation process of each major type, all crRNAs contain a spacer sequence and some portion of the repeat at one or both ends. It is the partial repeat sequence that prevents the CRISPR-Cas system from targeting the chromosome as base pairing beyond the spacer sequence signals self and prevents DNA cleavage. RNA-guided CRISPR enzymes are classified as type V restriction enzymes.
|CRISPR associated protein|
crystal structure of a crispr-associated protein from Thermus thermophilus
|CRISPR associated protein Cas2|
crystal structure of a hypothetical protein tt1823 from Thermus thermophilus
|CRISPR-associated protein Cse1|
The cas genes in the adaptor and effector modules of the CRISPR-Cas system are believed to have evolved from two different ancestral modules. A transposon-like element called casposon encoding the Cas1-like integrase and potentially other components of the adaptation module was inserted next to the ancestral effector module, which likely functioned as an independent innate immune system. The highly conserved cas1 and cas2 genes of the adaptor module evolved from the ancestral module while a variety of class 1 effector was genes evolved from the ancestral effector module. The evolution of these various class 1 effector module cas genes was guided by various mechanisms, such as duplication events. On the other hand, each type of class 2 effector module arose from subsequent independent insertions of mobile genetic elements. These mobile genetic elements took the place of the multiple gene effector modules to create single gene effector modules that produce large proteins which perform all the necessary tasks of the effector module. The spacer regions of CRISPR-Cas systems are taken directly from foreign mobile genetic elements and thus their long term evolution is hard to trace. The non-random evolution of these spacer regions has been found to be highly dependent on the environment and the particular foreign mobile genetic elements it contains.
CRISPR/Cas can immunize bacteria against certain phages and thus halt transmission. For this reason, Koonin described CRISPR/Cas as a Lamarckian inheritance mechanism. However, this was disputed by a critic who noted, "We should remember [Lamarck] for the good he contributed to science, not for things that resemble his theory only superficially. Indeed, thinking of CRISPR and other phenomena as Lamarckian only obscures the simple and elegant way evolution really works". As more recent studies have been conducted, it has become apparent that the acquired spacer regions of CRISPR-Cas systems are a form of Lamarckian evolution because they are genetic mutations that are acquired and then passed on. On the other hand, the evolution of the Cas gene machinery that facilitates the system evolves through classic Darwinian evolution.
Analysis of CRISPR sequences revealed coevolution of host and viral genomes. Cas9 proteins are highly enriched in pathogenic and commensal bacteria. CRISPR/Cas-mediated gene regulation may contribute to the regulation of endogenous bacterial genes, particularly during interaction with eukaryotic hosts. For example, Francisella novicida uses a unique, small, CRISPR/Cas-associated RNA (scaRNA) to repress an endogenous transcript encoding a bacterial lipoprotein that is critical for F. novicida to dampen host response and promote virulence.
The basic model of CRISPR evolution is newly incorporated spacers driving phages to mutate their genomes to avoid the bacterial immune response, creating diversity in both the phage and host populations. To resist a phage infection, the sequence of the CRISPR spacer must correspond perfectly to the sequence of the target phage gene. Phages can continue to infect their hosts given point mutations in the spacer. Similar stringency is required in PAM or the bacterial strain remains phage sensitive.
A study of 124 S. thermophilus strains showed that 26% of all spacers were unique and that different CRISPR loci showed different rates of spacer acquisition. Some CRISPR loci evolve more rapidly than others, which allowed the strains' phylogenetic relationships to be determined. A comparative genomic analysis showed that E. coli and S. enterica evolve much more slowly than S. thermophilus. The latter's strains that diverged 250 thousand years ago still contained the same spacer complement.
Metagenomic analysis of two acid-mine-drainage biofilms showed that one of the analyzed CRISPRs contained extensive deletions and spacer additions versus the other biofilm, suggesting a higher phage activity/prevalence in one community than the other. In the oral cavity, a temporal study determined that 7â€“22% of spacers were shared over 17 months within an individual while less than 2% were shared across individuals.
From the same environment a single strain was tracked using PCR primers specific to its CRISPR system. Broad-level results of spacer presence/absence showed significant diversity. However, this CRISPR added 3 spacers over 17 months, suggesting that even in an environment with significant CRISPR diversity some loci evolve slowly.
CRISPRs were analysed from the metagenomes produced for the human microbiome project. Although most were body-site specific, some within a body site are widely shared among individuals. One of these loci originated from streptococcal species and contained â‰ˆ15,000 spacers, 50% of which were unique. Similar to the targeted studies of the oral cavity, some showed little evolution over time.
CRISPR evolution was studied in chemostats using S. thermophilus to directly examine spacer acquisition rates. In one week, S. thermophilus strains acquired up to three spacers when challenged with a single phage. During the same interval the phage developed single nucleotide polymorphisms that became fixed in the population, suggesting that targeting had prevented phage replication absent these mutations.
Another S. thermophilus experiment showed that phages can infect and replicate in hosts that have only one targeting spacer. Yet another showed that sensitive hosts can exist in environments with high phage titres. The chemostat and observational studies suggest many nuances to CRISPR and phage (co)evolution.
CRISPRs are widely distributed among bacteria and archaea and show some sequence similarities. Their most notable characteristic is their repeating spacers and direct repeats. This characteristic makes CRISPRs easily identifiable in long sequences of DNA, since the number of repeats decreases the likelihood of a false positive match.
Analysis of CRISPRs in metagenomic data is more challenging, as CRISPR loci do not typically assemble, due to their repetitive nature or through strain variation, which confuses assembly algorithms. Where many reference genomes are available, polymerase chain reaction (PCR) can be used to amplify CRISPR arrays and analyse spacer content. However, this approach yields information only for specifically targeted CRISPRs and for organisms with sufficient representation in public databases to design reliable polymerase chain reaction (PCR) primers. Degenerate repeat-specific primers can be used to amplify CRISPR spacers directly from environmental samples; amplicons containing two or three spacers can be then computationally assembled to reconstruct long CRISPR arrays.
The alternative is to extract and reconstruct CRISPR arrays from shotgun metagenomic data. This is computationally more difficult, particularly with second generation sequencing technologies (e.g. 454, Illumina), as the short read lengths prevent more than two or three repeat units appearing in a single read. CRISPR identification in raw reads has been achieved using purely de novo identification or by using direct repeat sequences in partially assembled CRISPR arrays from contigs (overlapping DNA segments that together represent a consensus region of DNA) and direct repeat sequences from published genomes as a hook for identifying direct repeats in individual reads.
Use by phages
Another way for bacteria to defend against phage infection is by having chromosomal islands. A subtype of chromosomal islands called phage-inducible chromosomal island (PICI) is excised from a bacterial chromosome upon phage infection and can inhibit phage replication. PICIs are induced, excised, replicated and finally packaged into small capsids by certain staphylococcal temperate phages. PICIs use several mechanisms to block phage reproduction. In first mechanism PICI-encoded Ppi differentially blocks phage maturation by binding or interacting specifically with phage TerS, hence blocks phage TerS/TerL complex formation responsible for phage DNA packaging. In second mechanism PICI CpmAB redirect the phage capsid morphogenetic protein to make 95% of SaPI-sized capsid and phage DNA can package only 1/3rd of their genome in these small capsid and hence become nonviable phage. The third mechanism involves two proteins, PtiA and PtiB, that target the LtrC, which is responsible for the production of virion and lysis proteins. This interference mechanism is modulated by a modulatory protein, PtiM, binds to one of the interference-mediating proteins, PtiA, and hence achieving the required level of interference.
One study showed that lytic ICP1 phage, which specifically targets Vibrio cholerae serogroup O1, has acquired a CRISPR/Cas system that targets a V. cholera PICI-like element. The system has 2 CRISPR loci and 9 Cas genes. It seems to be homologous to the I-F system found in Yersinia pestis. Moreover, like the bacterial CRISPR/Cas system, ICP1 CRISPR/Cas can acquire new sequences, which allows phage and host to co-evolve.
Certain archaeal viruses were shown to carry mini-CRISPR arrays containing one or two spacers. It has been shown that spacers within the virus-borne CRISPR arrays target other viruses and plasmids, suggesting that mini-CRISPR arrays represent a mechanism of heterotypic superinfection exclusion and participate in interviral conflicts.
CRISPR gene editing
By the end of 2014 some 1000 research papers had been published that mentioned CRISPR. The technology had been used to functionally inactivate genes in human cell lines and cells, to study Candida albicans, to modify yeasts used to make biofuels and to genetically modify crop strains. CRISPR can also be used to change mosquitos so they cannot transmit diseases such as malaria. CRISPR based approaches utilizing Cas12a have recently been utilized in the successful modification of a broad number of plant species.
In July 2019, doctors in Tennessee, United States, used CRISPR to experimentally treat a patient with a genetic disorder. The patient was a 34-year-old woman with sickle cell disease.
In the future, CRISPR gene editing could be used to create new species or revive extinct species from closely related ones.
CRISPR-based re-evaluations of claims for gene-disease relationships have led to the discovery of potentially important anomalies.
CRISPR as diagnostic tool
CRISPR associated nucleases have shown to be useful as a tool for molecular testing due to their ability to specifically target nucleic acid sequences in a high background of non-target sequences. In 2016, the Cas9 nuclease was used to deplete unwanted nucleotide sequences in next-generation sequencing libraries while requiring only 250 pg of initial RNA input. Beginning in 2017, CRISPR associated nucleases were also used for direct diagnostic testing, down to single molecule sensitivity. In 2019, some new diagnostic applications using CRISPR technology, an electrical CRISPR-Chip for the detection of genetic mutations and a microfluidic CRISPR-Biosensor for electrochemical miRNA diagnostics, are introduced.
- Mulepati S, HÃ©roux A, Bailey S (2014). "Crystal structure of a CRISPR RNAâ€“guided surveillance complex bound to a ssDNA target". Science. 345 (6203): 1479â€“1484. Bibcode:2014Sci...345.1479M. doi:10.1126/science.1256996. PMC 4427192. PMID 25123481.
- Barrangou R (2015). "The roles of CRISPR-Cas systems in adaptive immunity and beyond". Current Opinion in Immunology. 32: 36â€“41. doi:10.1016/j.coi.2014.12.008. PMID 25574773.
- Zhang F, Wen Y, Guo X (2014). "CRISPR/Cas9 for genome editing: progress, implications and challenges". Human Molecular Genetics. 23 (R1): R40â€“6. doi:10.1093/hmg/ddu125. PMID 24651067.
- CRISPR-CAS9, TALENS and ZFNS - the battle in gene editing https://www.ptglab.com/news/blog/crispr-cas9-talens-and-zfns-the-battle-in-gene-editing/
- Hsu PD, Lander ES, Zhang F (June 2014). "Development and applications of CRISPR-Cas9 for genome engineering". Cell. 157 (6): 1262â€“1278. doi:10.1016/j.cell.2014.05.010. PMC 4343198. PMID 24906146.
- Horvath P, Barrangou R (January 2010). "CRISPR/Cas, the immune system of bacteria and archaea". Science. 327 (5962): 167â€“170. Bibcode:2010Sci...327..167H. doi:10.1126/science.1179555. PMID 20056882.
- Redman M, King A, Watson C, King D (August 2016). "What is CRISPR/Cas9?". Archives of Disease in Childhood. Education and Practice Edition. 101 (4): 213â€“215. doi:10.1136/archdischild-2016-310459. PMC 4975809. PMID 27059283.
- Barrangou R, Fremaux C, Deveau H, Richards M, Boyaval P, Moineau S, et al. (March 2007). "CRISPR provides acquired resistance against viruses in prokaryotes". Science. 315 (5819): 1709â€“1712. Bibcode:2007Sci...315.1709B. doi:10.1126/science.1138140. PMID 17379808. (registration required)
- Marraffini LA, Sontheimer EJ (December 2008). "CRISPR interference limits horizontal gene transfer in staphylococci by targeting DNA". Science. 322 (5909): 1843â€“1845. Bibcode:2008Sci...322.1843M. doi:10.1126/science.1165771. PMC 2695655. PMID 19095942.
- Mohanraju P, Makarova KS, Zetsche B, Zhang F, Koonin EV, van der Oost J (2016). "Diverse evolutionary roots and mechanistic variations of the CRISPR-Cas systems" (PDF). Science. 353 (6299): aad5147. doi:10.1126/science.aad5147. hdl:1721.1/113195. PMID 27493190.
- Hille F, Richter H, Wong SP, BratoviÄ M, Ressel S, Charpentier E (March 2018). "The Biology of CRISPR-Cas: Backward and Forward". Cell. 172 (6): 1239â€“1259. doi:10.1016/j.cell.2017.11.032. hdl:21.11116/0000-0003-FC0D-4. PMID 29522745.
- Ishino Y, Shinagawa H, Makino K, Amemura M, Nakata A (December 1987). "Nucleotide sequence of the iap gene, responsible for alkaline phosphatase isozyme conversion in Escherichia coli, and identification of the gene product". Journal of Bacteriology. 169 (12): 5429â€“5433. doi:10.1128/jb.169.12.5429-5433.1987. PMC 213968. PMID 3316184.
- Ishino Y, Shinagawa H, Makino K, Amemura M, Nakata A (December 1987). "Nucleotide sequence of the iap gene, responsible for alkaline phosphatase isozyme conversion in Escherichia coli, and identification of the gene product". Journal of Bacteriology. 169 (12): 5429â€“5433. doi:10.1128/jb.169.12.5429-5433.1987. PMC 213968. PMID 3316184.
- van Soolingen D, de Haas PE, Hermans PW, Groenen PM, van Embden JD (August 1993). "Comparison of various repetitive DNA elements as genetic markers for strain differentiation and epidemiology of Mycobacterium tuberculosis". Journal of Clinical Microbiology. 31 (8): 1987â€“1995. PMC 265684. PMID 7690367.
- Groenen PM, Bunschoten AE, van Soolingen D, van Embden JD (December 1993). "Nature of DNA polymorphism in the direct repeat cluster of Mycobacterium tuberculosis; application for strain differentiation by a novel typing method". Molecular Microbiology. 10 (5): 1057â€“1065. doi:10.1111/j.1365-2958.1993.tb00976.x. PMID 7934856.
- Mojica FJ, Montoliu L (2016). "On the Origin of CRISPR-Cas Technology: From Prokaryotes to Mammals". Trends in Microbiology. 24 (10): 811â€“820. doi:10.1016/j.tim.2016.06.005. PMID 27401123.
- Mojica FJ, Rodriguez-Valera F (2016). "The discovery of CRISPR in archaea and bacteria" (PDF). The FEBS Journal. 283 (17): 3162â€“3169. doi:10.1111/febs.13766. hdl:10045/57676. PMID 27234458.
- Mojica FJ, DÃez-VillaseÃ±or C, Soria E, Juez G (April 2000). "Biological significance of a family of regularly spaced repeats in the genomes of Archaea, Bacteria and mitochondria". Molecular Microbiology. 36 (1): 244â€“246. doi:10.1046/j.1365-2958.2000.01838.x. PMID 10760181.
- Barrangou R, van der Oost J (2013). CRISPR-Cas Systems : RNA-mediated Adaptive Immunity in Bacteria and Archaea. Heidelberg: Springer. p. 6. ISBN 978-3-642-34656-9.
- Charpentier E, Richter H, van der Oost J, White MF (May 2015). "Biogenesis pathways of RNA guides in archaeal and bacterial CRISPR-Cas adaptive immunity". FEMS Microbiology Reviews. 39 (3): 428â€“441. doi:10.1093/femsre/fuv023. PMC 5965381. PMID 25994611.
- Jansen R, Embden JD, Gaastra W, Schouls LM (March 2002). "Identification of genes that are associated with DNA repeats in prokaryotes". Molecular Microbiology. 43 (6): 1565â€“1575. doi:10.1046/j.1365-2958.2002.02839.x. PMID 11952905.
- Horvath P, Barrangou R (January 2010). "CRISPR/Cas, the immune system of bacteria and archaea". Science. 327 (5962): 167â€“170. Bibcode:2010Sci...327..167H. doi:10.1126/Science.1179555. PMID 20056882.
- Marraffini LA, Sontheimer EJ (March 2010). "CRISPR interference: RNA-directed adaptive immunity in bacteria and archaea". Nature Reviews Genetics. 11 (3): 181â€“190. doi:10.1038/nrg2749. PMC 2928866. PMID 20125085.
- Grissa I, Vergnaud G, Pourcel C (May 2007). "The CRISPRdb database and tools to display CRISPRs and to generate dictionaries of spacers and repeats". BMC Bioinformatics. 8: 172. doi:10.1186/1471-2105-8-172. PMC 1892036. PMID 17521438.
- Pourcel C, Salvignol G, Vergnaud G (March 2005). "CRISPR elements in Yersinia pestis acquire new repeats by preferential uptake of bacteriophage DNA, and provide additional tools for evolutionary studies". Microbiology. 151 (Pt 3): 653â€“663. doi:10.1099/mic.0.27437-0. PMID 15758212.
- Mojica FJ, DÃez-VillaseÃ±or C, GarcÃa-MartÃnez J, Soria E (February 2005). "Intervening sequences of regularly spaced prokaryotic repeats derive from foreign genetic elements". Journal of Molecular Evolution. 60 (2): 174â€“182. Bibcode:2005JMolE..60..174M. doi:10.1007/s00239-004-0046-3. PMID 15791728.
- Bolotin A, Quinquis B, Sorokin A, Ehrlich SD (August 2005). "Clustered regularly interspaced short palindrome repeats (CRISPRs) have spacers of extrachromosomal origin". Microbiology. 151 (Pt 8): 2551â€“2561. doi:10.1099/mic.0.28048-0. PMID 16079334.
- Morange M (June 2015). "What history tells us XXXVII. CRISPR-Cas: The discovery of an immune system in prokaryotes". Journal of Biosciences. 40 (2): 221â€“223. doi:10.1007/s12038-015-9532-6. PMID 25963251.
- Lander ES (January 2016). "The Heroes of CRISPR". Cell. 164 (1â€“2): 18â€“28. doi:10.1016/j.cell.2015.12.041. PMID 26771483.
- Makarova KS, Grishin NV, Shabalina SA, Wolf YI, Koonin EV (March 2006). "A putative RNA-interference-based immune system in prokaryotes: computational analysis of the predicted enzymatic machinery, functional analogies with eukaryotic RNAi, and hypothetical mechanisms of action". Biology Direct. 1: 7. doi:10.1186/1745-6150-1-7. PMC 1462988. PMID 16545108.
- Marraffini LA (October 2015). "CRISPR-Cas immunity in prokaryotes". Nature. 526 (7571): 55â€“61. Bibcode:2015Natur.526...55M. doi:10.1038/nature15386. PMID 26432244.
- Pennisi E (August 2013). "The CRISPR craze". News Focus. Science. 341 (6148): 833â€“836. Bibcode:2013Sci...341..833P. doi:10.1126/science.341.6148.833. PMID 23970676.
- Brouns SJ, Jore MM, Lundgren M, Westra ER, Slijkhuis RJ, Snijders AP, Dickman MJ, Makarova KS, Koonin EV, van der Oost J (August 2008). "Small CRISPR RNAs guide antiviral defense in prokaryotes". Science. 321 (5891): 960â€“964. Bibcode:2008Sci...321..960B. doi:10.1126/science.1159689. PMC 5898235. PMID 18703739.
- Garneau JE, Dupuis MÃˆ, Villion M, Romero DA, Barrangou R, Boyaval P, et al. (November 2010). "The CRISPR/Cas bacterial immune system cleaves bacteriophage and plasmid DNA". Nature. 468 (7320): 67â€“71. Bibcode:2010Natur.468...67G. CiteSeerX 10.1.1.451.9645. doi:10.1038/nature09523. PMID 21048762.
- Deltcheva E, Chylinski K, Sharma CM, Gonzales K, Chao Y, Pirzada ZA, Eckert MR, Vogel J, Charpentier E (March 2011). "CRISPR RNA maturation by trans-encoded small RNA and host factor RNase III". Nature. 471 (7340): 602â€“607. Bibcode:2011Natur.471..602D. doi:10.1038/nature09886. PMC 3070239. PMID 21455174.
- Barrangou R (November 2015). "Diversity of CRISPR-Cas immune systems and molecular machines". Genome Biology. 16: 247. doi:10.1186/s13059-015-0816-9. PMC 4638107. PMID 26549499.
- Jinek M, Chylinski K, Fonfara I, Hauer M, Doudna JA, Charpentier E (August 2012). "A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity". Science. 337 (6096): 816â€“821. Bibcode:2012Sci...337..816J. doi:10.1126/science.1225829. PMC 6286148. PMID 22745249.
- Cong L, Ran FA, Cox D, Lin S, Barretto R, Habib N, Hsu PD, Wu X, Jiang W, Marraffini LA, Zhang F (February 2013). "Multiplex genome engineering using CRISPR/Cas systems". Science. 339 (6121): 819â€“823. Bibcode:2013Sci...339..819C. doi:10.1126/science.1231143. PMC 3795411. PMID 23287718.
- Mali P, Yang L, Esvelt KM, Aach J, Guell M, DiCarlo JE, Norville JE, Church GM (February 2013). "RNA-guided human genome engineering via Cas9". Science. 339 (6121): 823â€“826. Bibcode:2013Sci...339..823M. doi:10.1126/science.1232033. PMC 3712628. PMID 23287722.
- DiCarlo JE, Norville JE, Mali P, Rios X, Aach J, Church GM (April 2013). "Genome engineering in Saccharomyces cerevisiae using CRISPR-Cas systems". Nucleic Acids Research. 41 (7): 4336â€“4343. doi:10.1093/nar/gkt135. PMC 3627607. PMID 23460208.
- Zhang GC, Kong II, Kim H, Liu JJ, Cate JH, Jin YS (December 2014). "Construction of a quadruple auxotrophic mutant of an industrial polyploid saccharomyces cerevisiae strain by using RNA-guided Cas9 nuclease". Applied and Environmental Microbiology. 80 (24): 7694â€“7701. doi:10.1128/AEM.02310-14. PMC 4249234. PMID 25281382.
- Liu JJ, Kong II, Zhang GC, Jayakody LN, Kim H, Xia PF, Kwak S, Sung BH, Sohn JH, Walukiewicz HE, Rao CV, Jin YS (April 2016). "Metabolic Engineering of Probiotic Saccharomyces boulardii". Applied and Environmental Microbiology. 82 (8): 2280â€“2287. doi:10.1128/AEM.00057-16. PMC 4959471. PMID 26850302.
- Vyas VK, Barrasa MI, Fink GR (2015). "Candida albicans CRISPR system permits genetic engineering of essential genes and gene families". Science Advances. 1 (3): e1500248. Bibcode:2015SciA....1E0248V. doi:10.1126/sciadv.1500248. PMC 4428347. PMID 25977940.
- Ng H, Dean N (2017). "Candida albicans by Increased Single Guide RNA Expression". mSphere. 2 (2): e00385â€“16. doi:10.1128/mSphere.00385-16. PMC 5397569. PMID 28435892.
- Hwang WY, Fu Y, Reyon D, Maeder ML, Tsai SQ, Sander JD, Peterson RT, Yeh JR, Joung JK (March 2013). "Efficient genome editing in zebrafish using a CRISPR-Cas system". Nature Biotechnology. 31 (3): 227â€“229. doi:10.1038/nbt.2501. PMC 3686313. PMID 23360964.
- Gratz SJ, Cummings AM, Nguyen JN, Hamm DC, Donohue LK, Harrison MM, Wildonger J, O'Connor-Giles KM (August 2013). "Genome engineering of Drosophila with the CRISPR RNA-guided Cas9 nuclease". Genetics. 194 (4): 1029â€“1035. doi:10.1534/genetics.113.152710. PMC 3730909. PMID 23709638.
- Bassett AR, Tibbit C, Ponting CP, Liu JL (July 2013). "Highly efficient targeted mutagenesis of Drosophila with the CRISPR/Cas9 system". Cell Reports. 4 (1): 220â€“228. doi:10.1016/j.celrep.2013.06.020. PMC 3714591. PMID 23827738.
- Yan H, Opachaloemphan C, Mancini G, Yang H, Gallitto M, Mlejnek J, Leibholz A, Haight K, Ghaninia M, Huo L, Perry M, Slone J, Zhou X, Traficante M, Penick CA, Dolezal K, Gokhale K, Stevens K, Fetter-Pruneda I, Bonasio R, Zwiebel LJ, Berger SL, Liebig J, Reinberg D, Desplan C (August 2017). "An Engineered orco Mutation Produces Aberrant Social Behavior and Defective Neural Development in Ants". Cell. 170 (4): 736â€“747.e9. doi:10.1016/j.cell.2017.06.051. PMC 5587193. PMID 28802043.
- Trible W, Olivos-Cisneros L, McKenzie SK, Saragosti J, Chang NC, Matthews BJ, Oxley PR, Kronauer DJ (August 2017). "orco Mutagenesis Causes Loss of Antennal Lobe Glomeruli and Impaired Social Behavior in Ants". Cell. 170 (4): 727â€“735.e10. doi:10.1016/j.cell.2017.07.001. PMC 5556950. PMID 28802042.
- Kistler KE, Vosshall LB, Matthews BJ (April 2015). "Genome engineering with CRISPR-Cas9 in the mosquito Aedes aegypti". Cell Reports. 11 (1): 51â€“60. doi:10.1016/j.celrep.2015.03.009. PMC 4394034. PMID 25818303.
- Friedland AE, Tzur YB, Esvelt KM, ColaiÃ¡covo MP, Church GM, Calarco JA (August 2013). "Heritable genome editing in C. elegans via a CRISPR-Cas9 system". Nature Methods. 10 (8): 741â€“743. doi:10.1038/nmeth.2532. PMC 3822328. PMID 23817069.
- Jiang W, Zhou H, Bi H, Fromm M, Yang B, Weeks DP (November 2013). "Demonstration of CRISPR/Cas9/sgRNA-mediated targeted gene modification in Arabidopsis, tobacco, sorghum and rice". Nucleic Acids Research. 41 (20): e188. doi:10.1093/nar/gkt780. PMC 3814374. PMID 23999092.
- Wang H, Yang H, Shivalila CS, Dawlaty MM, Cheng AW, Zhang F, Jaenisch R (May 2013). "One-step generation of mice carrying mutations in multiple genes by CRISPR/Cas-mediated genome engineering". Cell. 153 (4): 910â€“918. doi:10.1016/j.cell.2013.04.025. PMC 3969854. PMID 23643243.
- Soni D, Wang DM, Regmi SC, Mittal M, Vogel SM, SchlÃ¼ter D, Tiruppathi C (May 2018). "Deubiquitinase function of A20 maintains and repairs endothelial barrier after lung vascular injury". Cell Death Discovery. 4 (60): 60. doi:10.1038/s41420-018-0056-3. PMC 5955943. PMID 29796309.
- Guo X, Li XJ (July 2015). "Targeted genome editing in primate embryos". Cell Research. 25 (7): 767â€“768. doi:10.1038/cr.2015.64. PMC 4493275. PMID 26032266.
- Baltimore D, Berg P, Botchan M, Carroll D, Charo RA, Church G, Corn JE, Daley GQ, Doudna JA, Fenner M, Greely HT, Jinek M, Martin GS, Penhoet E, Puck J, Sternberg SH, Weissman JS, Yamamoto KR (April 2015). "Biotechnology. A prudent path forward for genomic engineering and germline gene modification". Science. 348 (6230): 36â€“38. Bibcode:2015Sci...348...36B. doi:10.1126/science.aab1028. PMC 4394183. PMID 25791083.
- Larson MH, Gilbert LA, Wang X, Lim WA, Weissman JS, Qi LS (November 2013). "CRISPR interference (CRISPRi) for sequence-specific control of gene expression". Nature Protocols. 8 (11): 2180â€“2196. doi:10.1038/nprot.2013.132. PMC 3922765. PMID 24136345.
- Liang P, Xu Y, Zhang X, Ding C, Huang R, Zhang Z, et al. (May 2015). "CRISPR/Cas9-mediated gene editing in human tripronuclear zygotes". Protein & Cell. 6 (5): 363â€“372. doi:10.1007/s13238-015-0153-5. PMC 4417674. PMID 25894090.
- Yan MY, Yan HQ, Ren GX, Zhao JP, Guo XP, Sun YC (September 2017). "CRISPR-Cas12a-Assisted Recombineering in Bacteria". Applied and Environmental Microbiology. 83 (17). doi:10.1128/AEM.00947-17. PMC 5561284. PMID 28646112.
- Zetsche B, Gootenberg JS, Abudayyeh OO, Slaymaker IM, Makarova KS, Essletzbichler P, Volz SE, Joung J, van der Oost J, Regev A, Koonin EV, Zhang F (October 2015). "Cpf1 is a single RNA-guided endonuclease of a class 2 CRISPR-Cas system". Cell. 163 (3): 759â€“771. doi:10.1016/j.cell.2015.09.038. PMC 4638220. PMID 26422227.
- Fonfara I, Richter H, BratoviÄ M, Le Rhun A, Charpentier E (April 2016). "The CRISPR-associated DNA-cleaving enzyme Cpf1 also processes precursor CRISPR RNA". Nature. 532 (7600): 517â€“521. Bibcode:2016Natur.532..517F. doi:10.1038/nature17945. PMID 27096362.
- Kim H, Kim ST, Ryu J, Kang BC, Kim JS, and Kim SG (February 2017). "CRISPR/Cpf1-mediated DNA-free plant genome editing". Nature Communications. 8 (14406): 14406. Bibcode:2017NatCo...814406K. doi:10.1038/ncomms14406. PMC 5316869. PMID 28205546.
- "Cpf1 Nuclease". abmgood.com. Retrieved 2017-12-14.
- Hille F, Charpentier E (November 2016). "CRISPR-Cas: biology, mechanisms and relevance". Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 371 (1707): 20150496. doi:10.1098/rstb.2015.0496. PMC 5052741. PMID 27672148.
- Barrangou R, Marraffini LA (April 2014). "CRISPR-Cas systems: Prokaryotes upgrade to adaptive immunity". Molecular Cell. 54 (2): 234â€“244. doi:10.1016/j.molcel.2014.03.011. PMC 4025954. PMID 24766887.
- Tyson GW, Banfield JF (January 2008). "Rapidly evolving CRISPRs implicated in acquired resistance of microorganisms to viruses". Environmental Microbiology. 10 (1): 200â€“207. doi:10.1111/j.1462-2920.2007.01444.x. PMID 17894817.
- Makarova KS, Wolf YI, Alkhnbashi OS, Costa F, Shah SA, Saunders SJ, et al. (November 2015). "An updated evolutionary classification of CRISPR-Cas systems". Nature Reviews. Microbiology. 13 (11): 722â€“736. doi:10.1038/nrmicro3569. PMC 5426118. PMID 26411297.
- Wright AV, NuÃ±ez JK, Doudna JA (January 2016). "Biology and Applications of CRISPR Systems: Harnessing Nature's Toolbox for Genome Engineering". Cell. 164 (1â€“2): 29â€“44. doi:10.1016/j.cell.2015.12.035. PMID 26771484.
- Westra ER, Dowling AJ, Broniewski JM, van Houte S (November 2016). "Evolution and Ecology of CRISPR". Annual Review of Ecology, Evolution, and Systematics. 47 (1): 307â€“331. doi:10.1146/annurev-ecolsys-121415-032428.
- Wiedenheft B, Sternberg SH, Doudna JA (February 2012). "RNA-guided genetic silencing systems in bacteria and archaea". Nature. 482 (7385): 331â€“338. Bibcode:2012Natur.482..331W. doi:10.1038/nature10886. PMID 22337052.
- Deng L, Garrett RA, Shah SA, Peng X, She Q (March 2013). "A novel interference mechanism by a type IIIB CRISPR-Cmr module in Sulfolobus". Molecular Microbiology. 87 (5): 1088â€“1099. doi:10.1111/mmi.12152. PMID 23320564.
- Sinkunas T, Gasiunas G, Fremaux C, Barrangou R, Horvath P, Siksnys V (April 2011). "Cas3 is a single-stranded DNA nuclease and ATP-dependent helicase in the CRISPR/Cas immune system". The EMBO Journal. 30 (7): 1335â€“1342. doi:10.1038/emboj.2011.41. PMC 3094125. PMID 21343909.
- Huo Y, Nam KH, Ding F, Lee H, Wu L, Xiao Y, Farchione MD, Zhou S, Rajashankar K, Kurinov I, Zhang R, Ke A (September 2014). "Structures of CRISPR Cas3 offer mechanistic insights into Cascade-activated DNA unwinding and degradation". Nature Structural & Molecular Biology. 21 (9): 771â€“777. doi:10.1038/nsmb.2875. PMC 4156918. PMID 25132177.
- Chylinski K, Makarova KS, Charpentier E, Koonin EV (June 2014). "Classification and evolution of type II CRISPR-Cas systems". Nucleic Acids Research. 42 (10): 6091â€“6105. doi:10.1093/nar/gku241. PMC 4041416. PMID 24728998.
- Makarova KS, Aravind L, Wolf YI, Koonin EV (July 2011). "Unification of Cas protein families and a simple scenario for the origin and evolution of CRISPR-Cas systems". Biology Direct. 6: 38. doi:10.1186/1745-6150-6-38. PMC 3150331. PMID 21756346.
- Gasiunas G, Barrangou R, Horvath P, Siksnys V (September 2012). "Cas9-crRNA ribonucleoprotein complex mediates specific DNA cleavage for adaptive immunity in bacteria". Proceedings of the National Academy of Sciences of the United States of America. 109 (39): E2579â€“2586. Bibcode:2012PNAS..109E2579G. doi:10.1073/pnas.1208507109. PMC 3465414. PMID 22949671.
- Heler R, Samai P, Modell JW, Weiner C, Goldberg GW, Bikard D, Marraffini LA (March 2015). "Cas9 specifies functional viral targets during CRISPR-Cas adaptation". Nature. 519 (7542): 199â€“202. Bibcode:2015Natur.519..199H. doi:10.1038/nature14245. PMC 4385744. PMID 25707807.
- Nam KH, Kurinov I, Ke A (September 2011). "Crystal structure of clustered regularly interspaced short palindromic repeats (CRISPR)-associated Csn2 protein revealed Ca2+-dependent double-stranded DNA binding activity". The Journal of Biological Chemistry. 286 (35): 30759â€“30768. doi:10.1074/jbc.M111.256263. PMC 3162437. PMID 21697083.
- Chylinski K, Le Rhun A, Charpentier E (May 2013). "The tracrRNA and Cas9 families of type II CRISPR-Cas immunity systems". RNA Biology. 10 (5): 726â€“737. doi:10.4161/rna.24321. PMC 3737331. PMID 23563642.
- Cox DB, Gootenberg JS, Abudayyeh OO, Franklin B, Kellner MJ, Joung J, Zhang F (November 2017). "RNA editing with CRISPR-Cas13". Science. 358 (6366): 1019â€“1027. Bibcode:2017Sci...358.1019C. doi:10.1126/science.aaq0180. PMC 5793859. PMID 29070703.
- Aliyari R, Ding SW (January 2009). "RNA-based viral immunity initiated by the Dicer family of host immune receptors". Immunological Reviews. 227 (1): 176â€“188. doi:10.1111/j.1600-065X.2008.00722.x. PMC 2676720. PMID 19120484.
- Dugar G, Herbig A, FÃ¶rstner KU, Heidrich N, Reinhardt R, Nieselt K, Sharma CM (May 2013). "High-resolution transcriptome maps reveal strain-specific regulatory features of multiple Campylobacter jejuni isolates". PLoS Genetics. 9 (5): e1003495. doi:10.1371/journal.pgen.1003495. PMC 3656092. PMID 23696746.
- Hatoum-Aslan A, Maniv I, Marraffini LA (December 2011). "Mature clustered, regularly interspaced, short palindromic repeats RNA (crRNA) length is measured by a ruler mechanism anchored at the precursor processing site". Proceedings of the National Academy of Sciences of the United States of America. 108 (52): 21218â€“21222. Bibcode:2011PNAS..10821218H. doi:10.1073/pnas.1112832108. PMC 3248500. PMID 22160698.
- Yosef I, Goren MG, Qimron U (July 2012). "Proteins and DNA elements essential for the CRISPR adaptation process in Escherichia coli". Nucleic Acids Research. 40 (12): 5569â€“5576. doi:10.1093/nar/gks216. PMC 3384332. PMID 22402487.
- Swarts DC, Mosterd C, van Passel MW, Brouns SJ (2012). "CRISPR interference directs strand specific spacer acquisition". PLOS ONE. 7 (4): e35888. Bibcode:2012PLoSO...735888S. doi:10.1371/journal.pone.0035888. PMC 3338789. PMID 22558257.
- Babu M, Beloglazova N, Flick R, Graham C, Skarina T, Nocek B, et al. (January 2011). "A dual function of the CRISPR-Cas system in bacterial antivirus immunity and DNA repair". Molecular Microbiology. 79 (2): 484â€“502. doi:10.1111/j.1365-2958.2010.07465.x. PMC 3071548. PMID 21219465.
- Han D, Lehmann K, Krauss G (June 2009). "SSO1450â€”a CAS1 protein from Sulfolobus solfataricus P2 with high affinity for RNA and DNA". FEBS Letters. 583 (12): 1928â€“1932. doi:10.1016/j.febslet.2009.04.047. PMID 19427858.
- Wiedenheft B, Zhou K, Jinek M, Coyle SM, Ma W, Doudna JA (June 2009). "Structural basis for DNase activity of a conserved protein implicated in CRISPR-mediated genome defense". Structure. 17 (6): 904â€“912. doi:10.1016/j.str.2009.03.019. PMID 19523907.
- Beloglazova N, Brown G, Zimmerman MD, Proudfoot M, Makarova KS, Kudritska M, et al. (July 2008). "A novel family of sequence-specific endoribonucleases associated with the clustered regularly interspaced short palindromic repeats". The Journal of Biological Chemistry. 283 (29): 20361â€“20371. doi:10.1074/jbc.M803225200. PMC 2459268. PMID 18482976.
- Samai P, Smith P, Shuman S (December 2010). "Structure of a CRISPR-associated protein Cas2 from Desulfovibrio vulgaris". Acta Crystallographica Section F. 66 (Pt 12): 1552â€“1556. doi:10.1107/S1744309110039801. PMC 2998353. PMID 21139194.
- Nam KH, Ding F, Haitjema C, Huang Q, DeLisa MP, Ke A (October 2012). "Double-stranded endonuclease activity in Bacillus halodurans clustered regularly interspaced short palindromic repeats (CRISPR)-associated Cas2 protein". The Journal of Biological Chemistry. 287 (43): 35943â€“35952. doi:10.1074/jbc.M112.382598. PMC 3476262. PMID 22942283.
- NuÃ±ez JK, Kranzusch PJ, Noeske J, Wright AV, Davies CW, Doudna JA (June 2014). "Cas1-Cas2 complex formation mediates spacer acquisition during CRISPR-Cas adaptive immunity". Nature Structural & Molecular Biology. 21 (6): 528â€“534. doi:10.1038/nsmb.2820. PMC 4075942. PMID 24793649.
- NuÃ±ez JK, Lee AS, Engelman A, Doudna JA (March 2015). "Integrase-mediated spacer acquisition during CRISPR-Cas adaptive immunity". Nature. 519 (7542): 193â€“198. Bibcode:2015Natur.519..193N. doi:10.1038/nature14237. PMC 4359072. PMID 25707795.
- Wang J, Li J, Zhao H, Sheng G, Wang M, Yin M, Wang Y (November 2015). "Structural and Mechanistic Basis of PAM-Dependent Spacer Acquisition in CRISPR-Cas Systems". Cell. 163 (4): 840â€“853. doi:10.1016/j.cell.2015.10.008. PMID 26478180.
- NuÃ±ez JK, Harrington LB, Kranzusch PJ, Engelman AN, Doudna JA (November 2015). "Foreign DNA capture during CRISPR-Cas adaptive immunity". Nature. 527 (7579): 535â€“538. Bibcode:2015Natur.527..535N. doi:10.1038/nature15760. PMC 4662619. PMID 26503043.
- Sorek R, Lawrence CM, Wiedenheft B (2013). "CRISPR-mediated adaptive immune systems in bacteria and archaea". Annual Review of Biochemistry. 82 (1): 237â€“266. doi:10.1146/annurev-biochem-072911-172315. PMID 23495939.
- NuÃ±ez JK, Bai L, Harrington LB, Hinder TL, Doudna JA (June 2016). "CRISPR Immunological Memory Requires a Host Factor for Specificity". Molecular Cell. 62 (6): 824â€“833. doi:10.1016/j.molcel.2016.04.027. PMID 27211867.
- Fagerlund RD, Wilkinson ME, Klykov O, Barendregt A, Pearce FG, Kieper SN, Maxwell HW, Capolupo A, Heck AJ, Krause KL, Bostina M, Scheltema RA, Staals RH, Fineran PC (June 2017). "Spacer capture and integration by a type I-F Cas1-Cas2â€“3 CRISPR adaptation complex". Proceedings of the National Academy of Sciences of the United States of America. 114 (26): E5122â€“E5128. doi:10.1073/pnas.1618421114. PMC 5495228. PMID 28611213.
- Rollie C, Graham S, Rouillon C, White MF (February 2018). "Prespacer processing and specific integration in a Type I-A CRISPR system". Nucleic Acids Research. 46 (3): 1007â€“1020. doi:10.1093/nar/gkx1232. PMC 5815122. PMID 29228332.
- Horvath P, Romero DA, CoÃ»tÃ©-Monvoisin AC, Richards M, Deveau H, Moineau S, et al. (February 2008). "Diversity, activity, and evolution of CRISPR loci in Streptococcus thermophilus". Journal of Bacteriology. 190 (4): 1401â€“1412. doi:10.1128/JB.01415-07. PMC 2238196. PMID 18065539.
- Deveau H, Barrangou R, Garneau JE, LabontÃ© J, Fremaux C, Boyaval P, Romero DA, Horvath P, Moineau S (February 2008). "Phage response to CRISPR-encoded resistance in Streptococcus thermophilus". Journal of Bacteriology. 190 (4): 1390â€“1400. doi:10.1128/JB.01412-07. PMC 2238228. PMID 18065545.
- Mojica FJ, DÃez-VillaseÃ±or C, GarcÃa-MartÃnez J, Almendros C (March 2009). "Short motif sequences determine the targets of the prokaryotic CRISPR defence system". Microbiology. 155 (Pt 3): 733â€“740. doi:10.1099/mic.0.023960-0. PMID 19246744.
- LillestÃ¸l RK, Shah SA, BrÃ¼gger K, Redder P, Phan H, Christiansen J, Garrett RA (April 2009). "CRISPR families of the crenarchaeal genus Sulfolobus: bidirectional transcription and dynamic properties". Molecular Microbiology. 72 (1): 259â€“272. doi:10.1111/j.1365-2958.2009.06641.x. PMID 19239620.
- Shah SA, Hansen NR, Garrett RA (February 2009). "Distribution of CRISPR spacer matches in viruses and plasmids of crenarchaeal acidothermophiles and implications for their inhibitory mechanism". Biochemical Society Transactions. 37 (Pt 1): 23â€“28. doi:10.1042/BST0370023. PMID 19143596.
- DÃez-VillaseÃ±or C, GuzmÃ¡n NM, Almendros C, GarcÃa-MartÃnez J, Mojica FJ (May 2013). "CRISPR-spacer integration reporter plasmids reveal distinct genuine acquisition specificities among CRISPR-Cas I-E variants of Escherichia coli". RNA Biology. 10 (5): 792â€“802. doi:10.4161/rna.24023. PMC 3737337. PMID 23445770.
- Erdmann S, Garrett RA (September 2012). "Selective and hyperactive uptake of foreign DNA by adaptive immune systems of an archaeon via two distinct mechanisms". Molecular Microbiology. 85 (6): 1044â€“1056. doi:10.1111/j.1365-2958.2012.08171.x. PMC 3468723. PMID 22834906.
- Shah SA, Erdmann S, Mojica FJ, Garrett RA (May 2013). "Protospacer recognition motifs: mixed identities and functional diversity". RNA Biology. 10 (5): 891â€“899. doi:10.4161/rna.23764. PMC 3737346. PMID 23403393.
- Andersson AF, Banfield JF (May 2008). "Virus population dynamics and acquired virus resistance in natural microbial communities". Science. 320 (5879): 1047â€“1050. Bibcode:2008Sci...320.1047A. doi:10.1126/science.1157358. PMID 18497291.
- Pride DT, Sun CL, Salzman J, Rao N, Loomer P, Armitage GC, et al. (January 2011). "Analysis of streptococcal CRISPRs from human saliva reveals substantial sequence diversity within and between subjects over time". Genome Research. 21 (1): 126â€“136. doi:10.1101/gr.111732.110. PMC 3012920. PMID 21149389.
- Goren MG, Yosef I, Auster O, Qimron U (October 2012). "Experimental definition of a clustered regularly interspaced short palindromic duplicon in Escherichia coli". Journal of Molecular Biology. 423 (1): 14â€“16. doi:10.1016/j.jmb.2012.06.037. PMID 22771574.
- Datsenko KA, Pougach K, Tikhonov A, Wanner BL, Severinov K, Semenova E (July 2012). "Molecular memory of prior infections activates the CRISPR/Cas adaptive bacterial immunity system". Nature Communications. 3: 945. Bibcode:2012NatCo...3..945D. doi:10.1038/ncomms1937. PMID 22781758.
- Gesner EM, Schellenberg MJ, Garside EL, George MM, Macmillan AM (June 2011). "Recognition and maturation of effector RNAs in a CRISPR interference pathway". Nature Structural & Molecular Biology. 18 (6): 688â€“692. doi:10.1038/nsmb.2042. PMID 21572444.
- Sashital DG, Jinek M, Doudna JA (June 2011). "An RNA-induced conformational change required for CRISPR RNA cleavage by the endoribonuclease Cse3". Nature Structural & Molecular Biology. 18 (6): 680â€“687. doi:10.1038/nsmb.2043. PMID 21572442.
- Haurwitz RE, Jinek M, Wiedenheft B, Zhou K, Doudna JA (September 2010). "Sequence- and structure-specific RNA processing by a CRISPR endonuclease". Science. 329 (5997): 1355â€“1358. Bibcode:2010Sci...329.1355H. doi:10.1126/science.1192272. PMC 3133607. PMID 20829488.
- Kunin V, Sorek R, Hugenholtz P (2007). "Evolutionary conservation of sequence and secondary structures in CRISPR repeats". Genome Biology. 8 (4): R61. doi:10.1186/gb-2007-8-4-r61. PMC 1896005. PMID 17442114.
- Carte J, Wang R, Li H, Terns RM, Terns MP (December 2008). "Cas6 is an endoribonuclease that generates guide RNAs for invader defense in prokaryotes". Genes & Development. 22 (24): 3489â€“3496. doi:10.1101/gad.1742908. PMC 2607076. PMID 19141480.
- Wang R, Preamplume G, Terns MP, Terns RM, Li H (February 2011). "Interaction of the Cas6 riboendonuclease with CRISPR RNAs: recognition and cleavage". Structure. 19 (2): 257â€“264. doi:10.1016/j.str.2010.11.014. PMC 3154685. PMID 21300293.
- Niewoehner O, Jinek M, Doudna JA (January 2014). "Evolution of CRISPR RNA recognition and processing by Cas6 endonucleases". Nucleic Acids Research. 42 (2): 1341â€“1353. doi:10.1093/nar/gkt922. PMC 3902920. PMID 24150936.
- Semenova E, Jore MM, Datsenko KA, Semenova A, Westra ER, Wanner B, et al. (June 2011). "Interference by clustered regularly interspaced short palindromic repeat (CRISPR) RNA is governed by a seed sequence". Proceedings of the National Academy of Sciences of the United States of America. 108 (25): 10098â€“10103. Bibcode:2011PNAS..10810098S. doi:10.1073/pnas.1104144108. PMC 3121866. PMID 21646539.
- Gudbergsdottir S, Deng L, Chen Z, Jensen JV, Jensen LR, She Q, Garrett RA (January 2011). "Dynamic properties of the Sulfolobus CRISPR/Cas and CRISPR/Cmr systems when challenged with vector-borne viral and plasmid genes and protospacers". Molecular Microbiology. 79 (1): 35â€“49. doi:10.1111/j.1365-2958.2010.07452.x. PMC 3025118. PMID 21166892.
- Manica A, Zebec Z, Teichmann D, Schleper C (April 2011). "In vivo activity of CRISPR-mediated virus defence in a hyperthermophilic archaeon". Molecular Microbiology. 80 (2): 481â€“491. doi:10.1111/j.1365-2958.2011.07586.x. PMID 21385233.
- Jore MM, Lundgren M, van Duijn E, Bultema JB, Westra ER, Waghmare SP, et al. (May 2011). "Structural basis for CRISPR RNA-guided DNA recognition by Cascade" (PDF). Nature Structural & Molecular Biology. 18 (5): 529â€“536. doi:10.1038/nsmb.2019. PMID 21460843.
- Wiedenheft B, Lander GC, Zhou K, Jore MM, Brouns SJ, van der Oost J, Doudna JA, Nogales E (September 2011). "Structures of the RNA-guided surveillance complex from a bacterial immune system". Nature. 477 (7365): 486â€“489. Bibcode:2011Natur.477..486W. doi:10.1038/nature10402. PMC 4165517. PMID 21938068.
- Zhang J, Rouillon C, Kerou M, Reeks J, Brugger K, Graham S, Reimann J, Cannone G, Liu H, Albers SV, Naismith JH, Spagnolo L, White MF (February 2012). "Structure and mechanism of the CMR complex for CRISPR-mediated antiviral immunity". Molecular Cell. 45 (3): 303â€“313. doi:10.1016/j.molcel.2011.12.013. PMC 3381847. PMID 22227115.
- Hale CR, Zhao P, Olson S, Duff MO, Graveley BR, Wells L, Terns RM, Terns MP (November 2009). "RNA-guided RNA cleavage by a CRISPR RNA-Cas protein complex". Cell. 139 (5): 945â€“956. doi:10.1016/j.cell.2009.07.040. PMC 2951265. PMID 19945378.
- Estrella MA, Kuo FT, Bailey S (2016). "RNA-activated DNA cleavage by the Type III-B CRISPRâ€“Cas effector complex". Genes and Development. 30 (4): 460â€“470. doi:10.1101/gad.273722.115. PMC 4762430. PMID 26848046.
- Samai P, Pyenson N, Jiang W, Goldberg GW, Hatoum-Aslan A, Marraffini LA (2015). "Co-transcriptional DNA and RNA Cleavage during Type III CRISPR-Cas Immunity". Cell. 161 (5): 1164â€“1174. doi:10.1016/j.cell.2015.04.027. PMC 4594840. PMID 25959775.
- Marraffini LA, Sontheimer EJ (January 2010). "Self versus non-self discrimination during CRISPR RNA-directed immunity". Nature. 463 (7280): 568â€“571. Bibcode:2010Natur.463..568M. doi:10.1038/nature08703. PMC 2813891. PMID 20072129.
- Krupovic M, BÃ©guin P, Koonin EV (August 2017). "Casposons: mobile genetic elements that gave rise to the CRISPR-Cas adaptation machinery". Current Opinion in Microbiology. 38: 36â€“43. doi:10.1016/j.mib.2017.04.004. PMC 5665730. PMID 28472712.
- Koonin EV, Makarova KS (May 2013). "CRISPR-Cas: evolution of an RNA-based adaptive immunity system in prokaryotes". RNA Biology. 10 (5): 679â€“686. doi:10.4161/rna.24022. PMC 3737325. PMID 23439366.
- Koonin EV, Makarova KS, Zhang F (June 2017). "Diversity, classification and evolution of CRISPR-Cas systems". Current Opinion in Microbiology. 37: 67â€“78. doi:10.1016/j.mib.2017.05.008. PMC 5776717. PMID 28605718.
- Shmakov S, Smargon A, Scott D, Cox D, Pyzocha N, Yan W, Abudayyeh OO, Gootenberg JS, Makarova KS, Wolf YI, Severinov K, Zhang F, Koonin EV (March 2017). "Diversity and evolution of class 2 CRISPR-Cas systems". Nature Reviews. Microbiology. 15 (3): 169â€“182. doi:10.1038/nrmicro.2016.184. PMC 5851899. PMID 28111461.
- Kupczok A, Bollback JP (February 2013). "Probabilistic models for CRISPR spacer content evolution". BMC Evolutionary Biology. 13 (1): 54. doi:10.1186/1471-2148-13-54. PMC 3704272. PMID 23442002.
- Sternberg SH, Richter H, Charpentier E, Qimron U (March 2016). "Adaptation in CRISPR-Cas Systems". Molecular Cell. 61 (6): 797â€“808. doi:10.1016/j.molcel.2016.01.030. hdl:21.11116/0000-0003-E74E-2. PMID 26949040.
- Koonin EV, Wolf YI (November 2009). "Is evolution Darwinian or/and Lamarckian?". Biology Direct. 4: 42. doi:10.1186/1745-6150-4-42. PMC 2781790. PMID 19906303.
- Weiss A (October 2015). "Lamarckian Illusions". Trends in Ecology & Evolution. 30 (10): 566â€“568. doi:10.1016/j.tree.2015.08.003. PMID 26411613.
- Koonin EV, Wolf YI (February 2016). "Just how Lamarckian is CRISPR-Cas immunity: the continuum of evolvability mechanisms". Biology Direct. 11 (1): 9. doi:10.1186/s13062-016-0111-z. PMC 4765028. PMID 26912144.
- Heidelberg JF, Nelson WC, Schoenfeld T, Bhaya D (2009). Ahmed N (ed.). "Germ warfare in a microbial mat community: CRISPRs provide insights into the co-evolution of host and viral genomes". PLOS ONE. 4 (1): e4169. Bibcode:2009PLoSO...4.4169H. doi:10.1371/journal.pone.0004169. PMC 2612747. PMID 19132092.
- Sampson TR, Saroj SD, Llewellyn AC, Tzeng YL, Weiss DS (May 2013). "A CRISPR/Cas system mediates bacterial innate immune evasion and virulence". Nature. 497 (7448): 254â€“257. Bibcode:2013Natur.497..254S. doi:10.1038/nature12048. PMC 3651764. PMID 23584588.
- Touchon M, Rocha EP (June 2010). Randau L (ed.). "The small, slow and specialized CRISPR and anti-CRISPR of Escherichia and Salmonella". PLOS ONE. 5 (6): e11126. Bibcode:2010PLoSO...511126T. doi:10.1371/journal.pone.0011126. PMC 2886076. PMID 20559554.
- Rho M, Wu YW, Tang H, Doak TG, Ye Y (2012). "Diverse CRISPRs evolving in human microbiomes". PLoS Genetics. 8 (6): e1002441. doi:10.1371/journal.pgen.1002441. PMC 3374615. PMID 22719260.
- Sun CL, Barrangou R, Thomas BC, Horvath P, Fremaux C, Banfield JF (February 2013). "Phage mutations in response to CRISPR diversification in a bacterial population". Environmental Microbiology. 15 (2): 463â€“470. doi:10.1111/j.1462-2920.2012.02879.x. PMID 23057534.
- Kuno S, Sako Y, Yoshida T (May 2014). "Diversification of CRISPR within coexisting genotypes in a natural population of the bloom-forming cyanobacterium Microcystis aeruginosa". Microbiology. 160 (Pt 5): 903â€“916. doi:10.1099/mic.0.073494-0. PMID 24586036.
- Sorek R, Kunin V, Hugenholtz P (March 2008). "CRISPRâ€”a widespread system that provides acquired resistance against phages in bacteria and archaea". Nature Reviews. Microbiology. 6 (3): 181â€“186. doi:10.1038/nrmicro1793. PMID 18157154.
Table 1: Web resources for CRISPR analysis
- Pride DT, Salzman J, Relman DA (September 2012). "Comparisons of clustered regularly interspaced short palindromic repeats and viromes in human saliva reveal bacterial adaptations to salivary viruses". Environmental Microbiology. 14 (9): 2564â€“2576. doi:10.1111/j.1462-2920.2012.02775.x. PMC 3424356. PMID 22583485.
- Held NL, Herrera A, Whitaker RJ (November 2013). "Reassortment of CRISPR repeat-spacer loci in Sulfolobus islandicus". Environmental Microbiology. 15 (11): 3065â€“3076. doi:10.1111/1462-2920.12146. PMID 23701169.
- Held NL, Herrera A, Cadillo-Quiroz H, Whitaker RJ (September 2010). "CRISPR associated diversity within a population of Sulfolobus islandicus". PLOS ONE. 5 (9): e12988. Bibcode:2010PLoSO...512988H. doi:10.1371/journal.pone.0012988. PMC 2946923. PMID 20927396.
- Medvedeva, S; Liu, Y; Koonin, EV; Severinov, K; Prangishvili, D; Krupovic, M (2019). "Virus-borne mini-CRISPR arrays are involved in interviral conflicts". Nature Communications. 10 (1): 5204. doi:10.1038/s41467-019-13205-2. PMC 6858448. PMID 31729390.
- Skennerton CT, Imelfort M, Tyson GW (May 2013). "Crass: identification and reconstruction of CRISPR from unassembled metagenomic data". Nucleic Acids Research. 41 (10): e105. doi:10.1093/nar/gkt183. PMC 3664793. PMID 23511966.
- Stern A, Mick E, Tirosh I, Sagy O, Sorek R (October 2012). "CRISPR targeting reveals a reservoir of common phages associated with the human gut microbiome". Genome Research. 22 (10): 1985â€“1994. doi:10.1101/gr.138297.112. PMC 3460193. PMID 22732228.
- Novick RP, Christie GE, PenadÃ©s JR (August 2010). "The phage-related chromosomal islands of Gram-positive bacteria". Nature Reviews Microbiology. 8 (8): 541â€“551. doi:10.1038/nrmicro2393. PMC 3522866. PMID 20634809.
- Ram G, Chen J, Kumar K, Ross HF, Ubeda C, Damle PK, Lane KD, PenadÃ©s JR, Christie GE, Novick RP (October 2012). "Staphylococcal pathogenicity island interference with helper phage reproduction is a paradigm of molecular parasitism". Proceedings of the National Academy of Sciences of the United States of America. 109 (40): 16300â€“16305. Bibcode:2012PNAS..10916300R. doi:10.1073/pnas.1204615109. PMC 3479557. PMID 22991467.
- Ram G, Chen J, Ross HF, Novick RP (October 2014). "Precisely modulated pathogenicity island interference with late phage gene transcription". Proceedings of the National Academy of Sciences of the United States of America. 111 (40): 14536â€“14541. Bibcode:2014PNAS..11114536R. doi:10.1073/pnas.1406749111. PMC 4209980. PMID 25246539.
- Seed KD, Lazinski DW, Calderwood SB, Camilli A (February 2013). "A bacteriophage encodes its own CRISPR/Cas adaptive response to evade host innate immunity". Nature. 494 (7438): 489â€“491. Bibcode:2013Natur.494..489S. doi:10.1038/nature11927. PMC 3587790. PMID 23446421.
- Doudna JA, Charpentier E (November 2014). "Genome editing. The new frontier of genome engineering with CRISPR-Cas9". Science. 346 (6213): 1258096. doi:10.1126/science.1258096. PMID 25430774.
- Ledford H (June 2015). "CRISPR, the disruptor". Nature. 522 (7554): 20â€“24. Bibcode:2015Natur.522...20L. doi:10.1038/522020a. PMID 26040877.
- Alphey L (2016). "Can CRISPR-Cas9 gene drives curb malaria?". Nature Biotechnology. 34 (2): 149â€“150. doi:10.1038/nbt.3473. PMID 26849518.
- BernabÃ©-Orts JM, Casas-Rodrigo I, Minguet EG, Landolfi V, Garcia-Carpintero V, Gianoglio S, et al. (April 2019). "Assessment of Cas12a-mediated gene editing efficiency in plants". Plant Biotechnology Journal. 17 (10): 1971â€“1984. doi:10.1111/pbi.13113. PMC 6737022. PMID 30950179.
- "In A 1st, Doctors In U.S. Use CRISPR Tool To Treat Patient With Genetic Disorder". NPR.org. Retrieved 2019-07-31.
- The-Crispr (2019-07-15). "Listen Radiolab CRISPR podcast". The Crispr. Retrieved 2019-07-15.
- Ledford H (2017). "CRISPR studies muddy results of older gene research". Nature. doi:10.1038/nature.2017.21763.
- Gu W, Crawford ED, O'Donovan BD, Wilson MR, Chow ED, Retallack H, DeRisi JL (March 2016). "Depletion of Abundant Sequences by Hybridization (DASH): using Cas9 to remove unwanted high-abundance species in sequencing libraries and molecular counting applications". Genome Biology. 17 (1): 41. doi:10.1186/s13059-016-0904-5. PMC 4778327. PMID 26944702.
- Gootenberg JS, Abudayyeh OO, Lee JW, Essletzbichler P, Dy AJ, Joung J, et al. (April 2017). "Nucleic acid detection with CRISPR-Cas13a/C2c2". Science. 356 (6336): 438â€“442. Bibcode:2017Sci...356..438G. doi:10.1126/science.aam9321. PMC 5526198. PMID 28408723.
- Chen JS, Ma E, Harrington LB, Da Costa M, Tian X, Palefsky JM, Doudna JA (April 2018). "CRISPR-Cas12a target binding unleashes indiscriminate single-stranded DNase activity". Science. 360 (6387): 436â€“439. Bibcode:2018Sci...360..436C. doi:10.1126/science.aar6245. PMC 6628903. PMID 29449511.
- "Sherlock Biosciences". sherlock.bio. Retrieved 2019-11-07.
- "Mammoth Biosciences". Mammoth Biosciences. Retrieved 2019-11-07.
- Hajian, Reza; Balderston, Sarah; Tran, Thanhtra; deBoer, Tara; Etienne, Jessy; Sandhu, Mandeep; Wauford, Noreen A.; Chung, Jing-Yi; Nokes, Jolie; Athaiya, Mitre; Paredes, Jacobo (June 2019). "Detection of unamplified target genes via CRISPRâ€“Cas9 immobilized on a graphene field-effect transistor". Nature Biomedical Engineering. 3 (6): 427â€“437. doi:10.1038/s41551-019-0371-x. ISSN 2157-846X. PMC 6556128. PMID 31097816.
- Bruch, Richard; Baaske, Julia; Chatelle, Claire; Meirich, Mailin; Madlener, Sibylle; Weber, Wilfried; Dincer, Can; Urban, Gerald Anton (2019-10-30). "CRISPR/Cas13aâ€Powered Electrochemical Microfluidic Biosensor for Nucleic Acid Amplificationâ€Free miRNA Diagnostics". Advanced Materials: 1905311. doi:10.1002/adma.201905311. ISSN 0935-9648.
- Doudna J, Mali P (23 March 2016). CRISPR-Cas: A Laboratory Manual. New York: Cold Spring Harbor Laboratory Press. ISBN 978-1-62182-131-1.
- Mohanraju P, Makarova KS, Zetsche B, Zhang F, Koonin EV, van der Oost J (August 2016). "Diverse evolutionary roots and mechanistic variations of the CRISPR-Cas systems" (PDF). Science. 353 (6299): aad5147. doi:10.1126/science.aad5147. hdl:1721.1/113195. PMID 27493190.
- Sander JD, Joung JK (April 2014). "CRISPR-Cas systems for editing, regulating and targeting genomes". Nature Biotechnology. 32 (4): 347â€“355. doi:10.1038/nbt.2842. PMC 4022601. PMID 24584096.
- Slaymaker IM, Gao L, Zetsche B, Scott DA, Yan WX, Zhang F (January 2016). "Rationally engineered Cas9 nucleases with improved specificity". Science. 351 (6268): 84â€“88. Bibcode:2016Sci...351...84S. doi:10.1126/science.aad5227. PMC 4714946. PMID 26628643.
- Terns RM, Terns MP (March 2014). "CRISPR-based technologies: prokaryotic defense weapons repurposed". Trends in Genetics. 30 (3): 111â€“118. doi:10.1016/j.tig.2014.01.003. PMC 3981743. PMID 24555991.
- Westra ER, Buckling A, Fineran PC (May 2014). "CRISPR-Cas systems: beyond adaptive immunity". Nature Reviews Microbiology. 12 (5): 317â€“326. doi:10.1038/nrmicro3241. PMID 24704746.
- Andersson AF, Banfield JF (May 2008). "Virus population dynamics and acquired virus resistance in natural microbial communities". Science. 320 (5879): 1047â€“1050. Bibcode:2008Sci...320.1047A. doi:10.1126/science.1157358. PMID 18497291.
- Hale C, Kleppe K, Terns RM, Terns MP (December 2008). "Prokaryotic silencing (psi)RNAs in Pyrococcus furiosus". RNA. 14 (12): 2572â€“2579. doi:10.1261/rna.1246808. PMC 2590957. PMID 18971321.
- van der Ploeg JR (June 2009). "Analysis of CRISPR in Streptococcus mutans suggests frequent occurrence of acquired immunity against infection by M102-like bacteriophages" (PDF). Microbiology. 155 (Pt 6): 1966â€“1976. doi:10.1099/mic.0.027508-0. PMID 19383692.
- van der Oost J, Brouns SJ (November 2009). "RNAi: prokaryotes get in on the act". Cell. 139 (5): 863â€“865. doi:10.1016/j.cell.2009.11.018. PMID 19945373.
- Karginov FV, Hannon GJ (January 2010). "The CRISPR system: small RNA-guided defense in bacteria and archaea". Molecular Cell. 37 (1): 7â€“19. doi:10.1016/j.molcel.2009.12.033. PMC 2819186. PMID 20129051.
- Pul U, Wurm R, Arslan Z, Geissen R, Hofmann N, Wagner R (March 2010). "Identification and characterization of E. coli CRISPR-cas promoters and their silencing by H-NS". Molecular Microbiology. 75 (6): 1495â€“1512. doi:10.1111/j.1365-2958.2010.07073.x. PMID 20132443.
- DÃez-VillaseÃ±or C, Almendros C, GarcÃa-MartÃnez J, Mojica FJ (May 2010). "Diversity of CRISPR loci in Escherichia coli". Microbiology. 156 (Pt 5): 1351â€“1361. doi:10.1099/mic.0.036046-0. PMID 20133361.
- Deveau H, Garneau JE, Moineau S (2010). "CRISPR/Cas system and its role in phage-bacteria interactions". Annual Review of Microbiology. 64: 475â€“493. doi:10.1146/annurev.micro.112408.134123. PMID 20528693.
- Koonin EV, Makarova KS (December 2009). "CRISPR-Cas: an adaptive immunity system in prokaryotes". F1000 Biology Reports. 1: 95. doi:10.3410/B1-95. PMC 2884157. PMID 20556198.
- "The age of the red pen". The Economist. August 22, 2015. ISSN 0013-0613. Retrieved 2015-08-25.
- Ran AF, Hsu PD, Wright J, Agarwala V, Scott DA, Zhang F (2013). "Genome engineering using the CRISPR-Cas9 system". Nature Protocols. 8 (11): 2281â€“2308. doi:10.1038/nprot.2013.143. PMC 3969860. PMID 24157548.
|Scholia has a topic profile for CRISPR.|
- Advanced Gene Editing: CRISPR-Cas9 Congressional Research Service
- Jennifer Doudna talk: Genome Engineering with CRISPR-Cas9: Birth of a Breakthrough Technology
- Overview of all the structural information available in the PDB for UniProt: Q46901 (CRISPR system Cascade subunit CasA) at the PDBe-KB.
- Overview of all the structural information available in the PDB for UniProt: P76632 (CRISPR system Cascade subunit CasB) at the PDBe-KB.
- Overview of all the structural information available in the PDB for UniProt: Q46899 (CRISPR system Cascade subunit CasC) at the PDBe-KB.
- Overview of all the structural information available in the PDB for UniProt: Q46898 (CRISPR system Cascade subunit CasD) at the PDBe-KB.
- Overview of all the structural information available in the PDB for UniProt: Q46897 (CRISPR system Cascade subunit CasE) at the PDBe-KB.
This tab holds the annotation information that is stored in the Pfam database. As we move to using Wikipedia as our main source of annotation, the contents of this tab will be gradually replaced by the Wikipedia tab.
CRISPR associated protein Cas2 Provide feedback
Members of this family of bacterial proteins comprise various hypothetical proteins, as well as CRISPR (clustered regularly interspaced short palindromic repeats) associated proteins, conferring resistance to infection by certain bacteriophages.
Barrangou R, Fremaux C, Deveau H, Richards M, Boyaval P, Moineau S, Romero DA, Horvath P; , Science. 2007;315:1709-1712.: CRISPR provides acquired resistance against viruses in prokaryotes. PUBMED:17379808 EPMC:17379808
This tab holds annotation information from the InterPro database.
InterPro entry IPR019199
The CRISPR-Cas system is a prokaryotic defense mechanism against foreign genetic elements. The key elements of this defense system are the Cas proteins and the CRISPR RNA.
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes [PUBMED:17442114]. CRISPRs appear to provide acquired resistance against mobile genetic elements (viruses, transposable elements and conjugative plasmids). CRISPR clusters contain sequences complementary to antecedent mobile elements and target invading nucleic acids. CRISPR clusters are transcribed and processed into CRISPR RNA (crRNA).
The defense reaction is divided into three stages. In the adaptation stage, the invader DNA is cleaved, and a piece of it is selected to be integrated as a new spacer into the CRISPR locus, where it is stored as an identity tag for future attacks by this invader. During the second stage (the expression stage), the CRISPR RNA (pre-crRNA) is transcribed and subsequently processed into the mature crRNAs. In the third stage (the interference stage), Cas proteins, together with crRNAs, identify and degrade the invader [PUBMED:17379808, PUBMED:16545108, PUBMED:21699496].
The CRISPR-Cas systems have been sorted into three major classes. In CRISPR-Cas types I and III, the mature crRNA is generally generated by a member of the Cas6 protein family. Whereas in system III the Cas6 protein acts alone, in some class I systems it is part of a complex of Cas proteins known as Cascade (CRISPR-associated complex for antiviral defense). The Cas6 protein is an endoribonuclease necessary for crRNA production whereas the additional Cas proteins that form the Cascade complex are needed for crRNA stability [PUBMED:24459147].
Members of this family of bacterial proteins comprise various hypothetical proteins, as well as CRISPR (clustered regularly interspaced short palindromic repeats) associated proteins, conferring resistance to infection by certain bacteriophages.
Below is a listing of the unique domain organisations or architectures in which this domain is found. More...
The graphic that is shown by default represents the longest sequence with a given architecture. Each row contains the following information:
- the number of sequences which exhibit this architecture
a textual description of the architecture, e.g. Gla, EGF x 2, Trypsin.
This example describes an architecture with one
Gladomain, followed by two consecutive
EGFdomains, and finally a single
- a link to the page in the Pfam site showing information about the sequence that the graphic describes
- the UniProt description of the protein sequence
- the number of residues in the sequence
- the Pfam graphic itself.
Note that you can see the family page for a particular domain by clicking on the graphic. You can also choose to see all sequences which have a given architecture by clicking on the Show link in each row.
Finally, because some families can be found in a very large number of architectures, we load only the first fifty architectures by default. If you want to see more architectures, click the button at the bottom of the page to load the next set.
Loading domain graphics...
We store a range of different sequence alignments for families. As well as the seed alignment from which the family is built, we provide the full alignment, generated by searching the sequence database (reference proteomes) using the family HMM. We also generate alignments using four representative proteomes (RP) sets, the UniProtKB sequence database, the NCBI sequence database, and our metagenomics sequence database. More...
There are various ways to view or download the sequence alignments that we store. We provide several sequence viewers and a plain-text Stockholm-format file for download.
We make a range of alignments for each Pfam-A family:
- the curated alignment from which the HMM for the family is built
- the alignment generated by searching the sequence database using the HMM
- Representative Proteomes (RPs) at 15%, 35%, 55% and 75% co-membership thresholds
- alignment generated by searching the UniProtKB sequence database using the family HMM
- alignment generated by searching the NCBI sequence database using the family HMM
- alignment generated by searching the metagenomics sequence database using the family HMM
You can see the alignments as HTML or in three different sequence viewers:
- a Java applet developed at the University of Dundee. You will need Java installed before running jalview
- an HTML page showing the whole alignment.Please note: full Pfam alignments can be very large. These HTML views are extremely large and often cause problems for browsers. Please use either jalview or the Pfam viewer if you have trouble viewing the HTML version
- an HTML-based representation of the alignment, coloured according to the posterior-probability (PP) values from the HMM. As for the standard HTML view, heatmap alignments can also be very large and slow to render.
You can download (or view in your browser) a text representation of a Pfam alignment in various formats:
You can also change the order in which sequences are listed in the alignment, change how insertions are represented, alter the characters that are used to represent gaps in sequences and, finally, choose whether to download the alignment or to view it in your browser directly.
You may find that large alignments cause problems for the viewers and the reformatting tool, so we also provide all alignments in Stockholm format. You can download either the plain text alignment, or a gzipped version of it.
We make a range of alignments for each Pfam-A family. You can see a description of each above. You can view these alignments in various ways but please note that some types of alignment are never generated while others may not be available for all families, most commonly because the alignments are too large to handle.
1Cannot generate PP/Heatmap alignments for seeds; no PP data available
Key: available, not generated, — not available.
Format an alignment
We make all of our alignments available in Stockholm format. You can download them here as raw, plain text files or as gzip-compressed files.
You can also download a FASTA format file containing the full-length sequences for all sequences in the full alignment.
HMM logos is one way of visualising profile HMMs. Logos provide a quick overview of the properties of an HMM in a graphical form. You can see a more detailed description of HMM logos and find out how you can interpret them here. More...
If you find these logos useful in your own work, please consider citing the following article:
This page displays the phylogenetic tree for this family's seed alignment. We use FastTree to calculate neighbour join trees with a local bootstrap based on 100 resamples (shown next to the tree nodes). FastTree calculates approximately-maximum-likelihood phylogenetic trees from our seed alignment.
Note: You can also download the data file for the tree.
Curation and family details
This section shows the detailed information about the Pfam family. You can see the definitions of many of the terms in this section in the glossary and a fuller explanation of the scoring system that we use in the scores section of the help pages.
|Seed source:||COGs (COG3512)|
|Author:||COGs, Finn RD , Sammut SJ|
|Number in seed:||414|
|Number in full:||3858|
|Average length of the domain:||74.30 aa|
|Average identity of full alignment:||22 %|
|Average coverage of the sequence by the domain:||66.94 %|
|HMM build commands:||
build method: hmmbuild -o /dev/null HMM SEED
search method: hmmsearch -Z 45638612 -E 1000 --cpu 4 HMM pfamseq
|Family (HMM) version:||9|
|Download:||download the raw HMM for this family|
Weight segments by...
Change the size of the sunburst
selected sequences to HMM
a FASTA-format file
- 0 sequences
- 0 species
This visualisation provides a simple graphical representation of the distribution of this family across species. You can find the original interactive tree in the More....
This chart is a modified "sunburst" visualisation of the species tree for this family. It shows each node in the tree as a separate arc, arranged radially with the superkingdoms at the centre and the species arrayed around the outermost ring.
How the sunburst is generated
The tree is built by considering the taxonomic lineage of each sequence that has a match to this family. For each node in the resulting tree, we draw an arc in the sunburst. The radius of the arc, its distance from the root node at the centre of the sunburst, shows the taxonomic level ("superkingdom", "kingdom", etc). The length of the arc represents either the number of sequences represented at a given level, or the number of species that are found beneath the node in the tree. The weighting scheme can be changed using the sunburst controls.
In order to reduce the complexity of the representation, we reduce the number of taxonomic levels that we show. We consider only the following eight major taxonomic levels:
Colouring and labels
Segments of the tree are coloured approximately according to their superkingdom. For example, archeal branches are coloured with shades of orange, eukaryotes in shades of purple, etc. The colour assignments are shown under the sunburst controls. Where space allows, the name of the taxonomic level will be written on the arc itself.
As you move your mouse across the sunburst, the current node will be highlighted. In the top section of the controls panel we show a summary of the lineage of the currently highlighed node. If you pause over an arc, a tooltip will be shown, giving the name of the taxonomic level in the title and a summary of the number of sequences and species below that node in the tree.
Anomalies in the taxonomy tree
There are some situations that the sunburst tree cannot easily handle and for which we have work-arounds in place.
Missing taxonomic levels
Some species in the taxonomic tree may not have one or more of the main eight levels that we display. For example, Bos taurus is not assigned an order in the NCBI taxonomic tree. In such cases we mark the omitted level with, for example, "No order", in both the tooltip and the lineage summary.
Unmapped species names
The tree is built by looking at each sequence in the full alignment for the family. We take the name of the species given by UniProt and try to map that to the full taxonomic tree from NCBI. In some cases, the name chosen by UniProt does not map to any node in the NCBI tree, perhaps because the chosen name is listed as a synonym or a misspelling in the NCBI taxonomy.
So that these nodes are not simply omitted from the sunburst tree, we group them together in a separate branch (or segment of the sunburst tree). Since we cannot determine the lineage for these unmapped species, we show all levels between the superkingdom and the species as "uncategorised".
Since we reduce the species tree to only the eight main taxonomic levels, sequences that are mapped to the sub-species level in the tree would not normally be shown. Rather than leave out these species, we map them instead to their parent species. So, for example, for sequences belonging to one of the Vibrio cholerae sub-species in the NCBI taxonomy, we show them instead as belonging to the species Vibrio cholerae.
Too many species/sequences
For large species trees, you may see blank regions in the outer layers of the sunburst. These occur when there are large numbers of arcs to be drawn in a small space. If an arc is less than approximately one pixel wide, it will not be drawn and the space will be left blank. You may still be able to get some information about the species in that region by moving your mouse across the area, but since each arc will be very small, it will be difficult to accurately locate a particular species.
The tree shows the occurrence of this domain across different species. More...
We show the species tree in one of two ways. For smaller trees we try to show an interactive representation, which allows you to select specific nodes in the tree and view them as an alignment or as a set of Pfam domain graphics.
Unfortunately we have found that there are problems viewing the interactive tree when the it becomes larger than a certain limit. Furthermore, we have found that Internet Explorer can become unresponsive when viewing some trees, regardless of their size. We therefore show a text representation of the species tree when the size is above a certain limit or if you are using Internet Explorer to view the site.
If you are using IE you can still load the interactive tree by clicking the "Generate interactive tree" button, but please be aware of the potential problems that the interactive species tree can cause.
For all of the domain matches in a full alignment, we count the number that are found on all sequences in the alignment. This total is shown in the purple box.
We also count the number of unique sequences on which each domain is found, which is shown in green. Note that a domain may appear multiple times on the same sequence, leading to the difference between these two numbers.
Finally, we group sequences from the same organism according to the NCBI code that is assigned by UniProt, allowing us to count the number of distinct sequences on which the domain is found. This value is shown in the pink boxes.
We use the NCBI species tree to group organisms according to their taxonomy and this forms the structure of the displayed tree. Note that in some cases the trees are too large (have too many nodes) to allow us to build an interactive tree, but in most cases you can still view the tree in a plain text, non-interactive representation. Those species which are represented in the seed alignment for this domain are highlighted.
You can use the tree controls to manipulate how the interactive tree is displayed:
- show/hide the summary boxes
- highlight species that are represented in the seed alignment
- expand/collapse the tree or expand it to a given depth
- select a sub-tree or a set of species within the tree and view them graphically or as an alignment
- save a plain text representation of the tree
Please note: for large trees this can take some time. While the tree is loading, you can safely switch away from this tab but if you browse away from the family page entirely, the tree will not be loaded.
There are 2 interactions for this family. More...
We determine these interactions using iPfam, which considers the interactions between residues in three-dimensional protein structures and maps those interactions back to Pfam families. You can find more information about the iPfam algorithm in the journal article that accompanies the website.
For those sequences which have a structure in the Protein DataBank, we use the mapping between UniProt, PDB and Pfam coordinate systems from the PDBe group, to allow us to map Pfam domains onto UniProt sequences and three-dimensional protein structures. The table below shows the structures on which the CRISPR_Cas2 domain has been found. There are 35 instances of this domain found in the PDB. Note that there may be multiple copies of the domain in a single PDB structure, since many structures contain multiple copies of the same protein sequence.
Loading structure mapping...