Introducing a third article type – the Protocol Extension

2016 is an important year for Nature Protocols as in the summer we will be celebrating our 10 year anniversary. Both in response to the success of some of our earlier protocols and reflecting the enormous improvements in the methods available to researchers over the past ten years, this month we are launching a new article type, the Protocol Extension.

Our first Protocol Extension is, as the name implies, still in the traditional protocol format. The method described in this article is based on an earlier version of the technique, for which we published a protocol back in 2006 from a different research group. This protocol is widely used by researchers and various changes to methodology have been made over recent years. Several of these adaptations are described in the new Protocol Extension article. Unlike Protocol Update articles, which should be used in place of the protocol they update, Protocol Extension articles provide a substantial modification or additional applications. They complement, rather than replace the earlier protocol, and may be authored by the same, or a different group of researchers. However, they also have some similarities to Protocol Updates, in that the Protocol Extension articles have headers to make their article type clear, they are always at the start of an issue, and there is a statement in the abstract to make it clear the article modifies or offers additional applications to one or more existing protocols.

As method development is an iterative process it is not always clear what constitutes an extension to a protocol. Very few new methods are developed that do not build on an earlier method, thus most of our protocols could be viewed as extending existing methods to a certain extent. In view of this we have decided to reserve this new article type for those articles where a new protocol is clearly warranted, and there is a clear reason to link the new article to an earlier protocol, for example, where sections of the procedure are the same.

As always, if you have any comments and suggestions, we would very much like to hear your views.

SWATH-MS at Nature Protocols and Scientific Data

The lab of Ruedi Aebersold recently published a Nature Protocol for generating peptide libraries for targeted analysis of SWATH MS data-independent acquisition mass spectrometry. This protocol complements a data descriptor (published in Scientific Data by the same authors) for a large-scale human assay library that can be used to support protein quantification by SWATH-MS.

In order to understand an organism’s biology and how it responds to environmental changes (e.g. disease or drug treatments), many researchers would like to be able to correctly identify and even quantify as many of the proteins in any given sample as possible. Mass spectrometry – in particular liquid chromatography-coupled tandem mass spectrometry (LC-MS/MS) – is the analytical technique most commonly used for deep and reliable exploration of the proteome.

In bottom-up proteomics, the cells of the sample are lysed, the proteins are digested into shorter peptides, the non-peptide material is removed, and the peptides separated by liquid chromatography. In LC-MS, the column separating the peptides is interfaced with a mass spectrometer and the peptides are analysed as they elute.

Sometimes the researcher will already know which proteins will be most interesting for their study; in these cases Selected Reaction Monitoring (SRM) is a very sensitive mass spectrometry approach to detecting and consistently quantifying the peptides associated with these proteins over many samples. The reactions monitored are the specific fragmentations associated with peptides-of-interest that occur in the mass spectrometer.

In many cases this information will not yet be fully available before the measurement or the number of potential protein-players in the given system might be very large. Data-independent acquisition of mass spectra for proteomics experiments is an alternative approach that extends the number of proteins that can be targeted in a sample from approximately one hundred (in SRM) to several thousand. It is made possible by the development of mass spectrometry instruments that are able to acquire high-resolutioin mass spectra at a very high sampling rate. The analysis method requires the researcher to perform initial experiments to put together a library of time and mass spectrometric coordinates corresponding to the peptides associated with each sample type. These spectra are then used as the basis for the development of highly specific assays to detect and quantify the respective peptide in subsequent samples.

We recently published a protocol for generating peptide libraries for a type of data independent mass spectrometry developed by Ruedi Aebersold and co-workers called SWATH-MS. The assay libraries are generated by collecting high-quality fragment ion spectra in data dependent acquisition mode, and processing these in a number of computational steps as shown in the workflow below.

Workflow for SWATH assay library generation

 

Scientific Data published a data descriptor for a generic large-scale human assay library to support protein quantification by SWATH-MS. This library was prepared using the procedure described in the Nature Protocol and consists of 1,164,312 transitions identifying 139,449 proteotypic peptides and 10,316 proteins; it was generated by combining the results from 331 measurements of fractions from different cell lines, tissue and affinity enriched protein samples. 

The data itself is deposited in the ProteomeXchange at the following locations:
PXD000953
PXD000954
and at SWATHAtlas

Stem cells in Singapore

At the start of last month I headed to Singapore for the ISSCR Regional Forum “Global Controls in Stem Cells”. I had a great time and found the meeting a big improvement on last year’s Florence meeting, due to the increase in networking opportunities.

There was a lot of epigenetics at the meeting – as you’d expect given the topic focus. Several people showed the classic Waddington depiction of the epigenetic landscape and explained the need to define where in the landscape the various cell types present during human development lie, plus the different varieties of artificially engineered stem cells. It was fascinating to hear about the various different approaches being taken to address this question.

I don’t have the space to discuss every talk, so here is a brief summary of the sessions and some of the talks.

The first keynote of the meeting was from Rudolf Jaenisch. He started by talking about naïve versus primed embryonic stem cells (ESCs) and whether there is a single human naïve state. He then moved on to talk about finding disease-relevant phenotypes, for example using patient-derived induced pluripotent stem cells (iPSCs). He particularly focused on the need for an appropriate control, he favours an isogenic control to control for genetic background.

The first session was focused on reprogramming, with various researchers talking about their attempts to improve reprogramming and to dissect reprogramming pathways. Jonathan Loh talked about the miniaturised reprogramming system using 384 well plates his group has developed to aid this process. Hongkui Deng finished the morning talks with a fantastic talk about reprogramming human fibroblasts to hepatocytes with drug metabolic function. Up until now reprogrammed hepatocytes have lacked this function, which is so important if hepatocytes derived in this way are going to be used for drug screening.

After lunch there was a session on chromatin dynamics. Bing Ren focused on the functional analysis of transcriptional enhancers and developing a new way to look at gene regulation from a 3D point of view. Sung Hee Baek talked about chromatin modifiers in epigenetic control of cancer and stem cells. Ken Zaret talked about the initiation of cell reprogramming, specifically his efforts to identify “Pioneer factors”, these bind to target genes that will turn on later. The final talk in this session was from Ralf Huss, about using stem cells as drug delivery tools in cancer therapy, due to their propensity to home in on the tumour.

The first session of day 2 was devoted to the developmental biology of pluripotency. Ron McKay, one of the conference organisers, spoke on the need to understand the functional identity of human genomes. Many early lineage cell types can be grown, and the human ESC are all quite different. This variation needs to be understood. Sara-Jane Dunn talked about her computational work modelling pluripotency, an alternative approach to investigating this biological phenomena.

The first session after lunch was on Telomere and RNA biology. V. Narry Kim spoke first. The main focus of her lab is miRNA regulation, but in this talk she talked about RNA-binding proteins in ESC. Next up was Didier Trono who talked about how transposable elements and their epigenetic control mechanisms are key regulators of transcriptional networks in PSCs. Yue Wan talked about the importance of RNA structure in determining RNA function, and the need to learn more about RNA structure. Toshio Suda talked about the aging of hematopoietic stem cells. Stem cells are long lived, and they do change as they age. He talked about the different characteristics of “elderly” stem cells.

The first session of Friday morning was focused to Functional Genomics. Huck Hui Ng gave the first talk, on his work investigating the systems biology of stem cells. Andras Nagy talked about his groups attempts to produce a stable pluripotent cell quite different from ESCs. Frank Buchholz talked about using RNA screens to find modulators and regulators of the pluripotent state.

After lunch there was a Cancer session, covering a wide variety of aspects. Richard Young started the session with a talk about the roles of transcriptional super-enhancers in cell identity and cancer. Zsuzsanna Izavak talked about her recent Nature paper on transcription driven by primate-specific endogenous retrovirus HERVH in naïve stem cells. The HERV driven network partially explains differences in mouse and human stem cells. The significant differences between mouse and human stem cells were mentioned by several of the speakers throughout the conference. Jesse Smith talked about gene transcription inhibitors as potential cancer therapeutics. The rationale being that cancer can be viewed as a disease of de-differentiation, i.e. dysregulated gene transcription.

Yukiko Gotoh talked about regulation of neural stem / progenitor cell fate in the embryonic and adult mouse brain. The basic premise of her work is that embryonic neural stem cells build a brain, whilst adult neural stem cells have a very different role, e.g. in learning and memory.

Janet Rossant gave the closing keynote lecture. She talked about cells in early embryo development and studies looking at when developmental potential is lost, comparing the mouse and human embryo. She also emphasised the differences between mouse and human early embryo development, and the importance of remembering this when studying stem cells.

I very much enjoyed my time in Singapore – it really is a beautiful city. I attended the night safari on the conference networking trip, which was great fun and a really unique experience. I even managed to secure a gift for my children left at home – a lego “Mr Stem Cell” so I can introduce them to the delights of stem cells at an early age!

Meeting report: 4th Single Molecule Localisation Microscopy Symposium

In the shadow of The Shard at the end of August, I joined about ~120 scientists at KCL for the 4th Single Molecule Localisation Microscopy Symposium. This 3-day meeting preceded the announcement in October that the Nobel Prize in Chemistry had been awarded to Eric Betzig, Stefan Hell and William E. Moerner for their contributions to the field of super-resolution (SR) microscopy (which includes SMLM approaches), but it was already very clear from the content and calibre of the programme that the field was having an important impact on several key areas of biological research.

The Symposium was superbly organized by Helge Ewers, Mike Heilemann, Aleksandra Radenovic, and Jean-Baptiste Sibarita; the only real disappointment being the cancellation of Stefan Hell’s keynote lecture due to illness. The 25 talks were divided into 5 broad topics: technical developments; biological systems; quantitative localization microscopy; labelling; and data analysis. It was particularly pleasing from my perspective that women researchers were well-represented, with at least 1 female speaker in all but 1 of the high quality sessions. Nature Protocols authors were also among the speakers1, 2. The full schedule of talks can be found here.

I found most of the talks to be surprisingly accessible and, as someone very interested in techniques and methods, there was a lot to hold my attention. Much effort is being expended on improving SMLM with respect to: the achievable resolution, especially axially; its live imaging capabilities; and quantitative applications. Progress in all these areas largely relies on a combination of new microscope setups, improved labels, and advanced data analysis methods. Biological applications discussed included nanoscale clustering of membrane proteins (particularly during synaptic and T cell signalling in mammalian cells), molecular assembly/organisation of protein complexes, and single particle tracking. The talks were complemented by ~60 posters, which covered additional applications such as imaging the cytoskeleton, chromatin structure, and plant tissues.

But it wasn’t all hard work – attendees were treated to an evening boat trip along the Thames, which provided the opportunity for less formal discussions over dinner, whilst enjoying the beautiful views on offer. I spent a very enjoyable evening in the company of a lovely group of PhD students and young post-docs discussing life the universe and everything – and trying not to feel my age!

At the close of my 3-day crash course in SMLM, I certainly felt much better-informed and very much inspired by the amazing work being done by this small but enthusiastic community. It is certainly an area of research I will continue to follow closely – perhaps a trip to Bordeaux is on the cards for next year!

 

1. Quantitative imaging of membrane lipid order in cells and organisms Dylan M Owen,    Carles Rentero, Astrid Magenau, Ahmed Abu-Siniyeh & Katharina Gaus Nature Protocols 7, 24–35 (2012) doi:10.1038/nprot.2011.419

2. Direct stochastic optical reconstruction microscopy with standard fluorescent probes Sebastian van de Linde, Anna Löschberger, Teresa Klein, Meike Heidbreder, Steve Wolter, Mike Heilemann & Markus Sauer Nature Protocols 6, 991–1009 (2011) doi:10.1038/nprot.2011.336

 

 

 

EMBO Young Investigators

This month saw the announcement of the EMBO Young Investigators for 2014, and we’re very pleased to count three of them amongst our Nature Protocols Authors! All in all, 27 researchers from 11 different countries were selected to receive the status of EMBO Young Investigator this year; you can see the full list of researchers along with their research interests here.

So, we’d like to congratulate all the new EMBO Young Investigators, and especially our Authors Filippo Del Bene, Ines Thiele and Johannes Zuber. Ines has written two Nature Protocols on the topic of metabolism, one for generating genome-scale metabolic reconstructions and a second for predicting cellular metabolism using the COBRA toolbox, both of which have been very popular additions to our content. With a focus on RNAi, Johannes’ protocol provides a pipeline for the generation of shRNA transgenic mice, and Filippo’s protocol for converting zebrafish transgenic lines from eGFP reporters to GAL4 drivers with CRISPR/Cas9 is one of our most recent articles, published in Nature Protocols earlier this month. It’s great that we’ve been able to help disseminate the work of such up-and-coming researchers, and we’re equally pleased that they have found us the place to publish their protocols!

Decoding neural circuit structure and function workshop

The last couple of months have been busy for the Nature Protocols editors, with Mel and I both travelling a fair amount. The end of September saw me heading off to Istanbul, Turkey for the EMBO workshop on “Decoding neural circuit structure and function”. I had been thinking a little warmth at the end of September would be a welcome change, but ironically I think it was actually warmer in London! And we didn’t really need all the rain to encourage us to attend sessions, given the excellent line up.

I don’t have the space to discuss all the talks at great length, and it was also encouraging to see so much unpublished data being presented, which I must not share. But here are my highlights.

The first talk of the meeting was by Juergen Knoblich, and it was great to hear about his current projects, using Drosophila genetics to understand cell cycle exit and quiescence plus the contribution of changing the balance of asymmetric versus symmetric divisions in development of various disorders. It was also good to hear that his group are doing more work on the brain organoids, whose creation has already been explained in their group’s recent Nature Protocols article. I was intrigued that a member of the audience asked about the ethical implications of such organoids – it was a question I asked myself whilst editing our article.

I enjoyed learning about behavioural assays using the larval zebrafish from Florian Engert. It was amazing to see what such tiny organisms are capable of. Scott Waddell’s talk was focused to resolving how an animal decides to do the most appropriate thing at a particular moment, using Drosophila as his model system. This question came up as the main research question being addressed in several speaker’s talks.

Another fascinating talk was by Leslie Vosshall on “Sensory cues driving host-seeking behavior in the dengue vector mosquito Aedes aegypti”. Leslie’s groups’ research should help address whether there is any truth in the various old wives tales concerning whether particular factors increase a person’s attractiveness to mosquitos. I spoke to some other researchers about Leslie’s talk and they are confident Leslie will be able to improve the repertoire of molecular biology research tools available for the study of Aedes aegypti. So, it is worth keeping an eye on what Leslie is up to.

The final talk of the conference was by Gero Miesenböck. I very much enjoyed the pictures of the T-maze apparatus his group uses to perform mass behavioural assays in Drosophila.

In addition to all the fabulous talks there were stimulating poster sessions. I particularly liked the poster by Deniz Atasoy from Medipol University, Istanbul, on a Genetically encoded synaptic marker for electron microscopy (GESEM) that he developed. This poster was awarded the Nature Protocols poster prize, going to the poster with the best protocol behind it. Special mention should also go to Xuefan Gao from the Max Planck Institute for Developmental Biology in Tubingen, whose poster on AVEXIS, a protein-protein interaction screen assay, was also very methodological in slant.

Do let us know if you are organising a conference and would like to have a Nature Protocols poster prize as we are always looking out for opportunities to sponsor such prizes.

We also enjoyed conference dinners at Boğaziçi Üniversitesi and on a boat on the Bosphorus – so it wasn’t all work! The conference organisers, Arzu Celik, Nilay Yapici and Hernan Lopez-Schier, deserve a special thank you for being so welcoming and organising such a good program. Istanbul is a beautiful city and well worth a visit.

Modelling the molecules of life – a talk by Michael Levitt

Last thursday I went to an inspiring talk given by Michael Levitt, joint winner of the 2013 Nobel prize for chemistry. It was the 2014 Sir Ernst Chain Lecture at Imperial College, London.
Starting with a project using lysozyme (where the co-ordinates for the computer input came from a ball and stick model and were typed onto punch cards) and early computer simulations of protein folding, to his more recent work on modelling ribosomes and eukaryote chaperonins, he presented a small slice of the amazing work that he and his collaborators have done. What was equally evident was the phenomenal advances in technology and computational power over the last 40 years.
He is currently holding a chair in Cancer Research at Stanford, and spoke a little about the work on humanised antibodies that he did with Cary Queen  at Protein Design Labs (a company that formed in 1987, is now PDL biopharma  and produced patents that underpinned the development of drugs like Herceptin,and Avastin).

He included a slide providing advice to the young:
Be passionate
Be persistent
Be original
Be kind and good (or, at least, act kind and good…)

He thanked the Nobel Committee for Chemistry for, amongst other things, considering awarding a prize for a method rather than a solution; a sentiment especially pleasing to the editors of Nature Protocols!

Some of the slides were similar to those published here.
In 2003, he prepared a series of video talklets to form a web-based class on Computational Structural Biology. These are available on-line; and I plan to listen to as many of these as I can this coming weekend! Structural Biology 228 | Computational Structural Biology

Top 100 papers of all time

I recently gave a talk in Singapore about publishing in Nature Protocols. When I give these talks, one of the things I explain is why Nature Publishing Group is interested in publishing methodological information. For that specific talk I had great new supporting document – an article in the previous weeks Nature about the “top 100 papers”. Of course, my top 100 papers is going to be very different from yours, but the top 100 for the purposes of this article was defined as the most cited papers of all time in the Science Citation Index, owned by Thomson Reuters.

cover_nature

When I saw the cover of Nature that week, and learnt of the news story, I knew what the most cited articles were going to be – methods papers. I knew about the high cites because when we first launched Nature Protocols we looked to see which methods papers were most cited, indicative, we believed, of people having reproduced the method and being in need of a protocol. We found such papers had amazingly high cites. An example of one of the highly cited papers serving as the inspiration for a protocol was the paper by Piotr Chomczynski and Nicoletta Sacchi on isolating RNA. We felt it was important that users of the assay understood how and why it worked, and published a protocol on the assay by the original inventors.

It will be fascinating to look in ten years’ time to see the methods being invented now that have taken off and revolutionised the way we do research. I very much hope our protocols will have helped facilitate their adoption in new labs around the world.

Nobel success

We are a little late to the party (it has been a busy month at Protocols HQ!), but Nature Protocols would like to extend their warmest congratulations to all this year’s winners of the Nobel Prize for Chemistry: Eric Betzig, Stefan Hell and William E. Moerner for their contributions to the development of super-resolution microscopy.

We are very pleased to have published a Nature Protocol from the Betzig lab earlier this year on Bessel beam plane illumination microscopy.  And we are equally pleased that the Protocol Exchange can claim a Nobel Laureate amongst its authors; the Moerner lab has published a guide to using Easy-DHPSF to measure the precise localisations of molecules in images acquired using a wide-field DH epifluorescence microscope. I would also encourage you to visit Moerner’s very informative lab website, if only to find out about the guacamole!

Cancer genomics and beyond…

About a month ago, I attended the ‘Beyond the Genome (BTG): Cancer Genomics’ meeting in Boston—my second conference as Chief Editor of Nature Protocols (my first was the 2014 ARR meeting at the University of Sussex). The BTG meeting grabbed my attention for several reasons; firstly, the topic was more-or-less within my comfort zone; secondly, there was a heavy focus on bioinformatics (a rapidly developing and important field); and thirdly, the line-up was fantastic. Fortunately, it was a small meeting so I was able to pin down several of the key speakers, including Fred Alt, Gad Getz, Peter Park, Nuria Lopez-Bigas, Mike Schatz, Nils Gehlenborg, and Rosalie Sears, and poke my head into some of their ‘labs’ (offices).

Given the bioinformatics focus of the meeting, it is perhaps unsurprising that many of them spoke about the need for easy-to-follow instructions for biologists branching out into bioinformatics and using complex computational tools for the first time. As Peter Park pointed out to me, “biologists and bioinformaticists speak different languages, and few journals seem to bridge this gap”. As it happens, some of our most popular protocols (e.g. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources, which has been cited almost 4,000 times) fit this bill perfectly, providing clear, concise, step-by-step instructions for using popular bioinformatics tools and software packages. Written by bioinformaticists, reviewed by biologists and bioinformaticists, and edited by (ex)-biologists, these protocols help researchers without a bioinformatics background to analyse their data themselves. Given the popularity of these protocols and the growing importance of bioinformatics within the biomedical sciences, this seems like an obvious area for expansion of the journal (so watch this space!)

With more than 40 talks over three days, a comprehensive overview of the BTG meeting is out of the question, so instead I will highlight a few of my favourites. On the first day, Gad Getz opened the talks with a discussion of MutSig, software developed by his group for identifying driver genes (by analysing lists of mutations and territory covered during sequencing) and building models of the background mutation processes that occur during tumour formation. Jan Korbel stepped up next, explaining the system he has set up for constructing maps of unbalanced SVs based on whole-genome DNA sequencing data. At the end of the day, Fred Alt spoke passionately about his recent work on high-throughput genomic translocation sequencing (HTGTS) strategies to identify translocations that arise from fixed DSBs, as well as sites of endogenous genomic DSBs.

The next morning, Rosalie Sears gave an interesting talk on the use of 3D tissue bioprinters to generate tumour tissue, as well as normal healthy tissue. Printer ‘ink’ 1 (consisting of endothelial cells, fibroblasts, and immune cells) is added first; printer ‘ink’ 2 (cancer cells) is then added to look at tissue distribution and assess cell interactions. Unfortunately, only a few labs have access to such advanced technology but this could change over the next 5-10 years as the set-up cost inevitably drops. Later that day, Lynda Chin talked about data she has collected using ChromHMM, a computational tool set up by Ernst and Kellis for predicting chromatin ‘states’ (based on combinations of chromatin marks) and characterizing their biological functions.

On the third and final day, Mike Schatz kicked off proceedings with a ‘bioinformatics challenge’ relating to single-cell CNV analysis. The challenge has become a regular feature of ‘Beyond the Genome’, encouraging students, postdocs, and analysts in the audience to get “down and dirty with data to solve an informatics problem as quickly as possible” (as Mike puts it). The task opened up to the floor this year (“should you chose to accept it”) was to resolve the population structure of a collection of cells, establishing which cells were in the same clone and, for each clone, which was the most highly amplified oncogene. Schatz provided data for simulated single-cell sequencing of a population of 9 cells with 250k reads per cell (all for chromosome 1 (not whole genome) at 0.5x coverage). He also provided a list of 100 candidate genes. As this coverage is too sparse to identify point mutations, the genome must be divided into “bins” (with 50-100 reads/bin) before mapping the reads and counting reads/bin. Copy number variations in a single cell can be identified as bins with significantly fewer or significantly more reads, and the population structure can be examined by finding cells with the same patterns of bin counts (see our Protocol on this).

After an initial flurry of activity as people rose to the challenge, the audience settled down to hear Nils Gehlenborg talk about the applications of StratomeX, an interactive visualization tool he developed in the Park lab to compare differences in molecular profiles across patient sets. Then, just after lunch, Schatz announced the winner of the bioinformatics challenge; René Böttcher, a PhD student in Guido Jenster’s lab in the Erasmus Medical Center in Rotterdam, was the first to solve the challenge in just over 1 hour (congratulations, René!). As the closing remarks drew nearer, Peter Park gave a fantastic talk on the different single-cell sequencing approaches, with tips on comparing data obtained using different techniques (e.g. GC bias with MDA sequencing, better read-depth stability at large scales with MALBEC than with MDA, more consistent depth at small scales with MDA, etc.). Shortly afterwards, a sea of smiling faces left the auditorium, with many promises to return next year.