Abstract
Nitrogen (N) is the most important nutrient for the growth of potato (Solanum tuberosum L.). Foliar gene expression in potato plants with and without N supplementation at 180 kg N ha−1 was compared at mid-season. Genes with consistent differences in foliar expression due to N supplementation over three cultivars and two developmental time points were examined. In total, thirty genes were found to be over-expressed and nine genes were found to be under-expressed with supplemented N. Functional relationships between over-expressed genes were found. The main metabolic pathway represented among differentially expressed genes was amino acid metabolism. The 1000 bp upstream flanking regions of the differentially expressed genes were analysed and nine overrepresented motifs were found using three motif discovery algorithms (Seeder, Weeder and MEME). These results point to coordinated gene regulation at the transcriptional level controlling steady state potato responses to N sufficiency.
Similar content being viewed by others
Introduction
Potatoes (Solanum tuberosum L.), constituting the third most grown crop worldwide, usually have a sparse and shallow root system, and are therefore particularly sensitive to abiotic factors such as water and nutrient availability1. The macronutrient nitrogen (N) positively impacts potato biomass, tuber yield and quality, especially in fields with a limited natural supply2,3. However, excessive application of N can have two main undesirable effects: 1) decreased quality of the tubers which can render them less suitable for industrial food production4 and 2) leaching of nitrate into water supply systems and the emission of nitrous oxide, both of which can cause environmental damage3. Therefore, long-standing goals within the potato production sector are to increase plant N use efficiency as well as develop sustainable N management systems to optimize N supplementation to the amount required to maintain plant growth and achieve target yields1,2.
Whole transcriptome analyses using RNA-seq to examine genes involved in N deficiency responses have been done in maize5, Arabidopsis6, cucumber7 and rice8. Both a well-annotated reference genome and a reference transcriptome (gene models) are needed to carry out differential gene expression analysis using RNA-seq. The potato reference genome and transcriptome were initially published in 20119,10. In 2012, a new annotation system (ITAG1.0) with updated gene models generated using new data from the tomato reference genome, was made available for both potato and tomato11,12. These resources have fuelled a renewed effort to analyse the molecular response of potato under different biotic and abiotic conditions13.
N related regulatory motifs have been identified in maize and Arabidopsis thaliana genes14,15 and point to coordinated responses to nitrate at the transcriptional level in plants. One of these motifs, the Nitrate Related cis-Element (NRE) identified in Arabidopsis, has also been found in the promoter region of the Nitrite Reductase (NIR) gene of several monocotyledonous and dicotyledonous plants such as spinach, tobacco, rice, maize and sorghum15. However, few studies have been done on regulatory motifs in the upstream regions of genes in potato. A study analysing the level of expression of a transgenic patatin class-I and β-glucuronidase (GUS) chimeric gene in field-grown potato found significant changes in expression depending on the promoter used in the construct and the regulatory motifs it contained16. These results highlight the need to further study and understand potential gene regulation mechanisms in potato, especially in response to critical abiotic factors such as N sufficiency.
Additionally, transcriptome analysis can be used for the discovery and annotation of overrepresented motifs. De novo motif discovery algorithms such as Seeder17 have been used before to predict the binding sites of regulatory elements in the upstream flanking regions of genes in other plant species18,19. Differences in the 5′-upstream flanking regions of potato genes, including variations in the number and types of regulatory motifs, have also been correlated with changes in gene expression16.
Transcriptome analysis can also be applied as an alternative method for quantifying N sufficiency20,21,22. Expression profiles associated with N sufficiency can be used to guide decisions on N fertilizer application in potato fields. Other technologies proposed for nutrient monitoring in crops include biosentinel plants that use promoters from nutrient responsive genes to drive reporter genes23. Both of these approaches can be enhanced through transcriptome analysis.
The current study uses RNA-seq data generated from three commercial potato cultivars (Shepody, Russet Burbank and Atlantic) to examine the steady state transcriptome response of potato to N supplementation. Genes with expression that was affected by the rate of supplemented N were further analysed for overrepresented DNA motifs in their upstream flanking regions through de novo motif discovery analysis. In all, 39 genes were differentially expressed in all three cultivars, and in total, nine potential nitrogen responsive motifs were identified.
Results
Effects of N supplementation on potato plants
The availability of N in the soil is known to cause measurable changes in certain characteristics of potato plants including dry biomass at harvest, fresh tuber yield, and chlorophyll content3. To determine the effects of N sufficiency, two contrasting rates of N supplementation were applied (0 kg N ha−1 and the recommended rate of 180 kg N ha−1) in a randomized complete block design (Table 1). Four replicated blocks were used. All trait measurements were statistically tested with a two-factor Analysis of Variance to determine the significance of the observed changes among plants from different cultivars grown under different N supplementation rates (Table 2, Supplementary Tables 1 and 2).
The chlorophyll content index was measured in foliar tissue samples collected from the field grown plants using Special Products Analysis Division (SPAD) readings. Plants without N supplementation had significantly lower SPAD readings than those grown with supplemented N (Fig. 1a). This result indicates that plants grown without added N had lower concentrations of chlorophyll in their foliar tissue, which is indicative of reduced N sufficiency. The SPAD readings among plants of different cultivars were also significantly different.
Petioles were collected from the same leaves used for the SPAD readings and the concentration of petiole nitrate was chemically determined for each biological replicate. Petioles collected from plants without supplemented N had significantly lower concentrations of petiole nitrate than did the plants grown with the addition of N (Fig. 1b). There were no significant differences in the petiole nitrate concentrations among plants of different cultivars.
The effects of N on biomass were measured before vine desiccation. To calculate the effect of N supplementation on biomass, whole plants were sampled and partitioned into vines, tubers, stolons and readily recoverable roots. Using information on the spatial distribution of plants in the field as well as the dry matter weight of the sampled plants, total biomass per hectare was calculated (Fig. 1c). The results show that biomass significantly increased in the groups grown with supplemented N. Dry matter weight also differed significantly among the three cultivars. However, the differences in biomass between cultivars were less pronounced than those observed due to N supplementation.
Finally, fresh tuber yield measured at harvest was analysed to determine the effect of N supplementation on yield. Tubers from the plants collected at harvest for biomass determination were also washed and weighed before drying. The total fresh yield for tubers was calculated using additional information on the spatial distribution of plants in the field (Fig. 1d). Tuber yield varied significantly among the three cultivars. N supplementation also had an observable effect on fresh tuber yield, although not as pronounced as the cultivar effect.
Transcriptome sequencing reveals N responsive genes
Sequencing of RNA samples was carried out to determine the differences in the transcriptomes of plants grown under two N treatments, deficient and sufficient, to obtain a list of N responsive differentially expressed genes. Total RNA was extracted from the foliar tissue samples collected from the same apical leaflet used for SPAD readings and petiole sampling. Paired-end sequencing (2 × 100 cycles) of the prepared RNA libraries was performed using a HiSeq 2000 [Illumina]. Sequencing results were trimmed and filtered for quality, and aligned to the potato reference genome10 using TopHat24.
The transcriptomes of plants grown with and without supplemented N at 180 kg N ha−1 were compared to find differences in gene expression that were highly specific to N status and that were consistent across cultivars and developmental time points. Samples were collected from three cultivars (Shepody, Russet Burbank and Atlantic) at two developmental time points: eight and ten weeks after planting. Sampling of the four replicated blocks was done over the course of a day (i.e. 0800 h, 1100 h, 1400 h and 1700 h for blocks 1 to 4 respectively) to allow for the removal of genes with significant time of day variation in expression in the analysis.
Aligned reads were analysed using CuffDiff25 and lists of differentially expressed genes in plants grown under different N treatments were made for each cultivar and developmental time point. Gene lists were compared to find genes with similar expression patterns across all cultivars and developmental time points. The experimental design was focused on identifying genes involved in steady state responses to N supplementation. Differentially expressed genes that were common among the cultivars and development time points were divided into two groups: those over-expressed in plants grown with supplemented N and those under-expressed in plants with supplemented N. A summary of the number of genes found to be differentially expressed in each analysis can be found in Supplementary Table 3.
In all, 30 genes were consistently over-expressed (Table 3) and nine genes under-expressed (Table 4) with N supplementation, across cultivars and over developmental time points. Differences were found in gene expression among cultivars and developmental time points, but they were not the focus of the current study. For example, 51 genes were over-expressed in only one of the two time-points and another four genes were under-expressed in only one of the two time-points (Supplementary Table 4). Alternative splicing analysis was also performed on the raw RNA-seq data, but did not reveal alternate splicing events in the 39 genes found to be N responsive in all cultivars (data not shown).
To validate the differential expression of the shared genes, the RNA samples were tested again using an nCounter Digital Analyzer (Nanostring Technologies, Inc.). Probes were designed for the 39 differentially expressed genes previously identified through RNA-seq. The nCounter reads were normalized using the expression of five housekeeping genes as a reference26,27. A Spearman Rank correlation was used to compare the measurements obtained from the nCounter Digital Analyzer with previous data generated through RNA-seq (Fig. 2). A positive correlation (r = 0.79) was found between the methods.
The N responsive genes were annotated using Gene Ontology (GO) terms and KEGG pathway information to determine if they have known functions in common. The GO information was obtained from the collected results of five GO term analysis pipelines (Trinotate (HMM and BLAST), OrthoMCL-UniProt, BLAST2GO, Phytozome and InterPro2GO) on ITAG1.0 genes28. GO terms were found for all but six of the N responsive genes. The KEGG pathway information and Enzyme Commission (E.C.) numbers of the genes were obtained from the SolCyc database in the Sol Genomics Network12. Out of the ten genes with associated E.C. numbers, seven were part of well-defined metabolic pathways (Tables 3 and 4): Alanine-glyoxylate aminotransferase (Sotub10g018540), Glutamate decarboxylase (Sotub01g005580), Glutathione S-transferase (Sotub10g024560), Sulfate adenylyltransferase (Sotub09g024290), Proline dehydrogenase (Sotub02g033320), Primary amine oxidase (Sotub08g025870), and Ubiquinone/menaquinone biosynthesis methyltransferase (Sotub12g020880).
Most of the KEGG pathways associated with the differentially expressed genes are involved in amino acid metabolism. In addition, one gene is directly associated with sulphate reduction pathways (Sulfate adenylyltransferase Sotub09g024290), as well as three other sulphate-related genes (Sulfate transporter Sotub04g021910; High affinity sulfate transporter 2 Sotub04g027100; High affinity sulfate transporter 2 Sotub10g013960) that have been previously reported to have an N response in Arabidopsis and tobacco29,30. The Gene Ontology analysis revealed a very wide variety of GO-terms associated with the differentially expressed genes. The most commonly found GO term was related to an integral component of the cell membrane (GO: 0016021). Other GO terms associated with the 39 differentially expressed genes included response to cadmium ion (GO:0046686), oxidation-reduction process (GO:0055114) and transmembrane transport (GO:0055085).
Overrepresented sequence motifs are present in the upstream flanking regions of nitrogen responsive genes
To predict potential N responsive regulatory mechanisms in the 39 differentially expressed genes, the 1000 bp upstream flanking regions were analysed for putative regulatory DNA motifs. De novo motif discovery tools Seeder17, Weeder31 and MEME32 were used to identify overrepresented motifs in the upstream flanking region of the differentially expressed genes. Each program works by implementing a completely different motif prediction algorithm, and analysing the sequences using all three tools increases the probability of finding more overrepresented motifs.
To discover overrepresented motifs, both Weeder and Seeder require a background file that contains information on k-mer frequencies in the relevant region or regions of the genome. Because motif discovery was focused on the upstream flanking region of N responsive genes, the reference background was computed from the sequences of the upstream flanking regions of all the genes in the Potato Reference Genome10. These sequences were obtained using the Transcription Start Site (TSS) and strand information for all the genes found in the ITAG1.0 annotation system11,12.
Seeder and MEME assume that overrepresented motifs will appear in all or most of the input sequences. This means that if one or several sequences do not contain an overrepresented motif, it might not be discovered. By carrying out multiple runs of motif discovery separately in smaller random subsets, there is an increased probability of finding motifs that are not present in all the sequences of the original list. Therefore, the two lists of upstream flanking regions of N responsive genes were sub-divided into smaller random sub-groups of 10 sequences and these smaller subsets were used as input for MEME and Seeder. Due to the nature of its algorithm, it is not necessary to use random sub-groups with Weeder and therefore a single list of upstream flanking regions was used as input for this program.
It is common for motif discovery programs to predict motifs that are very similar to each other, making it difficult to distinguish redundant and unique hits. This problem is further complicated when using random sub-groups as input, because the same motif might be present in several different sub-groups and will therefore appear multiple times in the final output. To overcome this issue, predicted motifs were clustered using the k-medoids clustering algorithm found in the TAMO package33 and the average of every cluster was then used as a representation of that cluster. The final motifs discovered in both lists of differentially expressed genes, after clustering, are summarized in Table 5.
Two databases of experimentally validated DNA-motifs, JASPAR34 and PLACE35, were consulted to determine whether any of the predicted motifs had previously reported functions that might have any relationship with N response. All discovered motifs returned at least one significant result from each of these databases, however none of those results matched the computationally discovered motif identically. Most of the results found in the databases differed with the discovered motifs by one nucleotide in the aligned region.
The results obtained from searching for motifs in the PLACE database can be used to predict regulatory mechanisms in potato because this database aggregates only experimentally validated motifs in plants. The matches from PLACE had several associated biological functions including the regulation of histone, auxin, and amylase genes (Table 5). A regulatory motif similar to one discovered by Weeder (Motif 3) has been previously shown to regulate patatin production in potato tuber, possibly through modulation with exogenous sucrose36. Also, three motifs discovered by Seeder (Motifs 5, 6, and 9) matched with PLACE motifs that have been associated with light-responsive promoters: 3AF137, S1F38 and GAPB39.
Because the NRE motif was described in previous studies15 but was not found with our motif discovery process, an attempt was made to find instances of this motif in the upstream regions of our 39 N responsive genes. None were found. The NRE motif was identified in the promoters of NIR genes in plants, however no NIR genes were identified as N responsive in the current study, because developmental stage and time of day variable genes were removed. The upstream flanking regions of three potato NIR genes (Sotub01g045870, Sotub08g028180, and Sotub10g014930) were therefore inspected and the NRE motif was indeed found in two of those regions. This indicates that the NRE motif is conserved in the upstream regions of the NIR genes in potato and therefore possibly involved in N response specific to developmental stage or time of day. However, there is no evidence in the current study to suggest that it regulates the steady state N responsive genes identified here.
Every discovered motif was mapped back to all upstream flanking regions in the N responsive genes. This served two purposes: it enabled the identification of additional redundant motifs, which mostly mapped to the same positions within the flanking regions, and it also permitted the comparison of upstream flanking regions from genes with similar reported functions. Figure 3 contains diagrams showing the position of all motif instances in the 5′-upstream flanking regions of all N responsive genes. In general, the motifs that appear the most times in the upstream-flanking regions of all N responsive genes are Motif 4 [ATGCGG] (62 times), Motif 5 [A.ATrGAC] (56 times) and Motif 7 [CATAGG] (40 times).
Discussion
The effect of N on the growth and yield of potato has been the subject of several studies, including those focused on the phenotypical effects of N-deficiency40,41 and more recently, those that have analysed the effects of N on the expression of a small subset of genes20,21,26. However, a modern tool like RNA-seq offers a new way to compare the whole transcriptome of potato plants grown under differing N status. Our experiment is one of the first to use RNA-seq to find differentially expressed genes in plants grown in the field, with and without N supplementation.
The current study found that the three potato cultivars examined responded in the same way to supplemented N. The results were increased tuber yield, significantly greater N uptake and dry biomass, as well as greater leaf chlorophyll content. Gene expression analysis identified genes responding to N sufficiency similarly across three cultivars and two developmental time points. Previous studies have shown that genes can be variably responsive to N depending on the time of day42. The experimental design of the current study removes genes with time of day variation, and leaves only genes that consistently show differential expression. The final set of 39 identified genes were those that showed a consistent response to N supplementation in all three cultivars throughout the day, both at eight and ten weeks after planting.
Our previous studies examined expression of potato genes involved in N uptake, assimilation and transport, and demonstrated that an ammonium transporter gene, AMT1, was responsive to N rates over different developmental time points21. However, this gene was found to have variation in expression at different times of the day42 and therefore did not meet criteria for screening in the current study. Functional analysis of the differentially expressed genes in the current study indicated association with KEGG pathways involved in amino acid metabolism. Two over-expressed genes included the aminotransaminases Aminotransferase-like protein (Sotub12g011100) and Alanine-glyoxylate aminotransferase (Sotub10g018540). Additionally, Proline dehydrogenase (Sotub02g033320) was found to be under-expressed with N supplementation. Decreased proline dehydrogenase activity was also found under conditions of N supply in French bean43, which concurs with the results of this study. The action of proline dehydrogenase and pyrroline-5-carboxylate dehydrogenase lead to degradation of proline to glutamate.
There is also evidence for regulation of C:N balance with N supplementation. Over-expressed genes encoded enzymes functioning in both C and N metabolic pathways including two enzymes in the GABA shunt: Aminotransferase-like protein (Sotub12g01110) and Glutamate decarboxylase (Sotub01g005580). The GABA shunt is involved in the regulation of C:N balance in plants. GABA may also have roles related to stress response and as a signalling molecule44. Another gene involved in both C and N metabolism was Alanine-glyoxylate aminotransferase (Sotub10g018540), which converts alanine and glyoxylate to glycine and pyruvate. This enzyme is involved in photorespiration in Arabidopsis45 and high levels of photorespiration are associated with low alanine and high glycine46. Interestingly, photorespiration is also linked to increased nitrate assimilation47.
Additionally, four sulfate-related genes were found to be N responsive, one of which was part of the sulfate reduction and sulfate activation pathways, indicating a potential relationship between the sulfate and nitrate metabolic pathways. This type of relationship has been observed before in tobacco where N availability has been shown to regulate ATP sulfurylase29 and, more recently, in Arabidopsis where sulfur transporter SULTR1;1 is found to be down-regulated in conditions of N insufficiency30.
GO-terms associated with N responsive genes were found to be related to transmembrane transport. These genes were involved in transport of sulfate, nitrate, amino acids and peptides; which correspond with the differential expression observed in amino acid and sulfate metabolism genes. Under-expression of a cation transporter and over-expression of a cation transport regulator were also found. These results indicate that proton movement may be involved in responses to N supplementation. Cation transport can affect the activity of the GABA shunt enzyme Glutamate decarboxylase, which is controlled by pH and Ca2+-calmodulin48.
The Flowering locus T protein gene (Sotub05g028860) in potato plays a role in controlling maturity and tuberization49. This gene was under-expressed in plants with supplemented N, suggesting that increased N sufficiency can delay maturity. Two genes that were also under-expressed are similar to Arabidopsis genes involved in N response: a nodule inception protein similar to the Arabidopsis NIN-like transcription factor (Sotub01g049920) and Nodulin (Sotub05g012720). The former is a nitrate responsive transcription factor50 and the latter participates in nodule formation in legumes, and in amino acid transport in non-leguminous plants51.
The N responsive genes were shown to have shared motifs in the upstream promoter regions, which supports the idea of coordinated regulation at the transcriptional level of genes responding to N supplementation. Motif discovery was carried out based on a previously described strategy that sub-divides genes into random subgroups to increase the probability of finding overrepresented motifs that are not present in all flanking regions (Munusamy et al., unpublished). Initial results of the motif discovery algorithms produced many redundant motifs, which was expected because different sub-groups may contain instances of the same motif. Therefore, a k-medoids clustering strategy was used to reduce the incidence of redundant motifs. In the end, the average motif of each cluster was taken as the representative motif and used in the subsequent annotation and mapping analyses.
Annotation of discovered motifs using experimentally validated motif databases, especially PLACE, revealed several intriguing putative regulatory mechanisms. Three of the overrepresented motifs in the upstream regions of N-responsive genes, including Motif 5 which was one of the motifs most frequently found in this dataset, were similar to motifs previously found to be associated with light-responsive genes37. Additionally, Motif 3 [GAGTAT/ATACTC] is very similar to a previously reported motif [AATACTAAT] involved in the regulation of patatin production in tubers as a response to exogenous sucrose36. These results are very similar to a previous study focusing on the regulation of patatin genes in potato16, which further suggests that C:N balance regulation is involved in the response to N supplementation.
The motif discovered by Seeder in the 5′-flanking region of under-expressed genes (Motif 9 TACCAC) is very similar to the binding site of transcription factor S1F [TACCAT], a cis-regulatory element associated in the down-regulation of plastid related genes such as rbcS, cab, and rp121.The introduction of this binding site into transgenic tobacco plants has been experimentally shown to cause the differential repression of the rps1 plastid gene in non-photosynthetic tissue38. The similarity of the predicted motif and the experimentally validated one suggests that lower concentrations of available N could be potentially triggering a repression of plastids in the foliar tissue.
Mapping of predicted motifs in the upstream flanking regions of N responsive genes revealed few obvious patterns in the incidence of motifs with relation to the Transcription Start Site (TSS). Motifs seem to have no defined position within the upstream flank, and there are several instances of motifs appearing multiple times within the same region, which is not uncommon in other organisms. The upstream flanking regions of two over-expressed genes (Sotub11g007110 and Sotub11g007090) stand out due to their similarity, both in number and location of predicted motifs. Interestingly, both genes have the exact same annotation in the ITAG1.0 system (protein of unknown function) and are located close to each other on the same chromosome. The similarity of the positions of the discovered motifs in the upstream-flanking regions of these genes indicates a similar molecular mechanism regulates the expression of both, and also suggests that they share very similar, if not identical function. By analysing these genes and searching for a similar distribution of motifs in the upstream flanking region of at least one other gene, it might be possible to finally identify the function of these unknown proteins as well as predict their relationship with N supply in potato.
Upstream regulatory motifs in N responsive genes have previously been found in other species, such as maize14. These genes included nitrate, nitrite and ammonium transporters; nitrate and nitrite reductases; as well as glutamate and glutamine synthases. Nitrate transporter was the only gene in common between the maize study, which identified genes expressed at 4 h after N treatment, and the present study, which identified genes expressed at steady state nitrogen supplementation. Therefore, it is not surprising that the putative regulatory motifs found in our study are different than those in the study in maize.
There was no observed similarity between the motifs found in the upstream regions of the 39 N responsive genes in potato and the NRE motif found in the upstream region of the nitrite reductase gene NIR1 of Arabidopsis and several other plants15. The NIR genes were not among the 39 genes that were differentially expressed in the current study. Previously, the authors have shown that a potato NIR gene had time of day42 and developmental stage21 variations and this is likely why it is not among the 39 genes. The current study focused on genes without time of day variation, and of longer term responses to N supplementation, with gene expression measurements at eight and ten weeks after planting. Additionally, because the experiment was done in the field, plants that did not receive N supplementation were not completely starved of N. The results demonstrate that longer term, steady state responses to N involve a different set of genes than shorter term, time of day variable responses, hence, regulatory motifs are also different. However, the presence of NRE in the upstream region of NIR genes in several different plant species (including potato) raises the possibility that the putative motifs found in this study could also be present in the N responsive genes of other plants.
In conclusion, our study provides evidence for regulatory coordination of steady state responses to N sufficiency at the level of gene expression in potato. These results have many potential applications including development of N status monitoring systems.
Materials and Methods
Plant Materials and Growth Conditions
Potato plants were propagated in the field at the Fredericton Research and Development Centre of Agriculture and Agri-Food Canada, Fredericton NB, Canada in 2012. The experiment included two fertilizer N rates (0 and 180 kg N ha−1) in a randomized complete block design with four blocks. Fertilizer N was banded at planting as ammonium nitrate (34-0-0). All plots also received 150 kg ha−1 of P2O5 and K2O banded at planting. Plots were six rows (5.46 m) by 8 m in size where the outer rows were guard rows.
The experiment was planted on May 23 using 0.91 m row spacing and 0.3 m within-row spacing. A modified planter was used to band the fertilizer treatments and open the rows. Hand-cut 50 g seed-pieces each of cultivar Atlantic (U.S. Department of Agriculture, 1978), Russet Burbank (L. Burbank, approx. 1880) and Shepody (Agriculture Canada-New Brunswick, 1969) were hand-planted and imidacloprid was applied to control for Colorado potato beetle. The seed-pieces were covered using discs. One hill of cv. Chieftain was planted at the end of each row to avoid edge effects.
Sampling was done on July 25 and August 8. At each sampling, the apical leaflet of the last fully expanded leaf (usually the fourth leaf from the top of the plant) was sampled from each of 15 randomly selected plants in each plot and placed into 50 ml Falcon tubes. The tube was immediately placed in liquid N, and stored at −80 °C until RNA extraction. Petioles were then collected from the same leaf for determination of petiole nitrate concentration.
Leaf Chlorophyll Measurement and Petiole Nitrate Determination
Leaf chlorophyll index (LCI-S) was measured on the apical leaflet of the last fully expanded leaf, which was also used to measure gene expression, using a SPAD-502 reader (Konika Minolta). The LCI-S was determined in the section of the leaf midway from the mid-rib to the leaf margin41. Petioles were dried at 55 °C and ground to pass a 2 mm screen. A 0.2 g subsample of petiole tissue was extracted with 40 ml distilled water and a 15 min shaking time. The concentration of NO3-N in the extract was determined colorimetrically using a Quikchem 8500 flow injection analyzer (Lachet) using QuikChem method 90-107-04-2-A41. Two-factor ANOVA of petiole nitrate determination and SPAD readings between the groups was calculated using the R statistical language v. 3.1.1.
Dry Biomass and Fresh Yield Determination
Whole plants were sampled before vine desiccation to determine their dry biomass content and at harvest to measure fresh tuber yield. Each plant was separated into vines, tubers and stolons as well as any recoverable roots. Tubers with a diameter below 0.5 cm were left as part of the stolons and roots. Vines, stolons and roots were washed, weighed and then oven-dried. After drying, they were weighed again and the dry matter of each sample was determined. Tubers were washed and weighed to determine fresh yield. Finally, tuber samples were taken from every experimental group and quartered along the long axis. One quarter of every tuber was randomly selected and sliced into 1 × 1 cm strips, and then weighed before and after oven-drying to determine dry matter content52. Two-factor ANOVA of dry biomass and fresh tuber yield between the groups was calculated using the R statistical language v. 3.1.1.
RNA Extraction
Leaf tissue was ground to a powder in liquid N using a mortar and pestle. Samples were pre-extracted in 1 ml Hot Borate Buffer26. The lysate supernatant (200 μl) was used for RNA extraction with the Biomek NXP Laboratory Automation Workstation (Beckman Coulter) using the RNAdvance Tissue Kit (Agencourt) according to the manufacturer’s instructions for liquid samples. The RNA concentration and quality were determined using a NanoDrop 1000 Spectrophotometer (Thermo Scientific) and 2100 Bioanalyzer (Aglient Technologies) respectively.
Library Preparation and Sequencing
Libraries were generated using the TruSeq RNA kit (Illumina). Messenger RNA was purified from 1 μg of total RNA using oligo-dT beads. The mRNA enriched fraction was reverse transcribed to generate cDNA fragments that were sheared to yield ~200 bp fragments. Following end-repair, 3′ end adenylation steps, and index ligation, a PCR amplification step was performed. A separate index was used for each N treatment-variety-replicate combination for a total of 24 indices.
Two lanes of sequencing were done for each 24 index multiplex, one for each time point. The quality of the library was assessed on a DNA 1000 chip and quantified by qPCR. Libraries were subjected to 100 bases of sequencing on a HiSeq 2000 (Illumina) instrument in paired-end mode. Initial quality control of the data was performed using the software included with the sequencer.
Genome Alignment and Differential Expression Analysis
Output from the sequencer was aligned to the S. tuberosum reference genome v3_2.1.1010 using the TopHat software suite v. 2.0.925 with the mode for “fr-unstranded” library types. The quality of the alignments was verified using the ‘flagstat’ tool from the SAMtools software suite v. 0.1.1953.
Reads were assembled into transcripts with CuffLinks v. 2.1.154, using the S. tuberosum ITAG1.0 annotation file obtained from the Sol Genomics Network11,12. Transcriptome assembly was performed with the ‘multi read correct’ and ‘fragment bias correct’ modes activated. Finally, assembled transcripts from different replicates and treatments were merged into a single reference transcriptome for each variety using the ‘CuffMerge’ tool included in CuffLinks.
Differentially expressed genes were identified for each time-point and each cultivar using CuffDiff55. The same S. tuberosum reference genome as well as the single, merged transcriptome were used as reference for differential gene expression. Finally, genes found to be differentially expressed in each cultivar, were compared using custom perl scripts and a single list of over-expressed and under-expressed genes found in all cultivars at both time-points was produced.
Expression Analysis using nCounter Digital Analyzer
The 39 genes with significant differences in expression from the CuffDiff analysis were selected and used to validate the gene expression results. The same RNA samples indicated above were prepared using the reagents and method described in Geiss et al. 2008 for the nCounter (Nanostring Technologies) multiplex gene expression analysis. The nCounter data was adjusted according to the manufacturer’s instructions using the manufacturer-provided spiked positive and negative controls. Gene expression of five housekeeping genes 18S rRNA, actin, cyclophilin56, elongation factor 1-α (EF-1-alpha)57 and cox1-B20 was also measured and the geometric mean of their expression was used to normalize gene expression values for the 39 test genes26,27. Spearman rank correlation was performed using SYSTAT v. 13 (Systat Software) and used to compare expression data from nCounter and transcriptome sequencing for the 39 genes.
De Novo Motif Discovery
A FASTA file containing the 1000bp upstream flanking regions upstream of the transcriptional start sites for all the genes in the S. tuberosum reference genome v3_2.1.1010 was generated using the ‘faidx’ tool of the SAMtools software suite v. 0.1.1953. The transcription start-site and strand information for every gene were obtained from the ITAG1.0 annotation file11,12. From this general file, two subsets were created, each containing only the upstream flanking regions of the genes that were found to be significantly over-expressed or under-expressed in response to supplemented N. Motif discovery was performed separately for each set of differentially expressed genes.
Three different programs were used for de novo motif discovery: Seeder v. 0.0117, MEME v. 4.10.032 and Weeder v. 2.031. All three motif discovery programs were run simultaneously on Guillimin, Mcgill University’s high-performance computing server (http://www.hpc.mcgill.ca/), using high-memory computing nodes. A series of 5000 random subsets containing 10 random promoters each were generated for each differentially expressed gene list these random subsets were then used as input for the motif discovery programs Seeder and MEME.
The FASTA file containing all 1000 bp upstream flanking regions in the genome was used to generate the background files in Seeder and Weeder. All motif discovery programs were run to find motifs with a minimum length of 6 bp in both the forward and reverse-complement strands.
Significance of predicted motifs was determined differently for each algorithm, based on the available parameters reported by the program: in Seeder, a maximum q-value of 0.05 was allowed; in MEME, a maximum e-value of 0.001 was allowed; finally, only the top five results of each Weeder run were considered significant.
Motif Annotation and Mapping
All significant motifs were converted into the same format for comparison and annotation using the TAMO software suite33. Redundant motifs were clustered together using k-medoids algorithm, as implemented in TAMO. Cluster averages were uploaded to STAMP58 for visualization and to search in the PLACE35 and JASPAR34 databases for potential matches.
Motif cluster averages were mapped to the promoters of differentially expressed genes using the ‘Sitemap’ tool provided in TAMO. The same approach was used to map the previously reported NRE motif. Mapping results were visualized using the ‘GenomeDiagram’ tool included in BioPython v.1.6159. Diagrams for visualization of nucleotide frequencies in motifs were all created using Weblogo v.2.860.
To facilitate the comparison of the promoters of genes with similar biological function, differentially expressed genes were annotated using GO terms based on the results obtained by Amar et al.28. Additional KEGG pathway information for differentially expressed genes, when available, was retrieved from the ‘SolCyc’ database for S. tuberosum in the Sol Genomics Network12.
Additional Information
How to cite this article: Gálvez, J. H. et al. The nitrogen responsive transcriptome in potato (Solanum tuberosum L.) reveals significant gene regulatory motifs. Sci. Rep. 6, 26090; doi: 10.1038/srep26090 (2016).
References
Birch, P. R. J. et al. Crops that feed the world 8: Potato: are the trends of increased global production sustainable? Food Secur. 4, 477–508 (2012).
Westermann, D. T. Nutritional requirements of potatoes. Am. J. Potato Res. 82, 301–307 (2005).
Zebarth, B. J. & Rosen, C. J. Research perspective on nitrogen BMP development for potato. Am. J. Potato Res. 84, 3–18 (2007).
Long, C., Snapp, S., Douches, D. & Chase, R. Tuber yield, storability, and quality of Michigan cultivars in response to nitrogen management and seedpiece spacing. Am. J. Potato Res. 81, 347–357 (2004).
Humbert, S. et al. Genome-wide expression profiling of maize in response to individual and combined water and nitrogen stresses. BMC Genomics 14, 3 (2013).
Vidal, E. A. et al. Integrated RNA-seq and sRNA-seq analysis identifies novel nitrate-responsive genes in Arabidopsis thaliana roots. BMC Genomics 14, 701 (2013).
Zhao, W. et al. RNA-Seq-based transcriptome profiling of early nitrogen deficiency response in cucumber seedlings provides new insight into the putative nitrogen regulatory network. Plant Cell Physiol. 56, 455–467 (2015).
Yang, W. et al. Transcriptome analysis of nitrogen-starvation-responsive genes in rice. BMC Plant Biol. 15, 31 (2015).
Massa, A. N. et al. The transcriptome of the reference potato genome Solanum tuberosum group Phureja clone DM1-3 516R44. PLoS One 6, 1–8 (2011).
The Potato Genome Sequencing Consortium. Genome sequence and analysis of the tuber crop potato. Nature 475, 189–195 (2011).
The Tomato Genome Consortium. The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485, 635–641 (2012).
Fernandez-Pozo, N. et al. The Sol Genomics Network (SGN)–from genotype to phenotype to breeding. Nucleic Acids Res. 43, 1–6 (2014).
Gong, L. et al. Transcriptome profiling of the potato (Solanum tuberosum L.) plant under drought stress and water-stimulus conditions. PLoS One 10, 1–20 (2015).
Liseron-Monfils, C. et al. Nitrogen transporter and assimilation genes exhibit developmental stage-selective expression in maize (Zea mays L.) associated with distinct cis-acting promoter motifs. Plant Signal. Behav. 8, 1–14 (2013).
Konishi, M. & Yanagisawa, S. Roles of the transcriptional regulation mediated by the nitrate-responsive cis-element in higher plants. Biochem. Biophys. Res. Commun. 411, 708–713 (2011).
Aminedi, R. & Das, N. Class I patatin genes from potato (Solanum tuberosum L.) cultivars: molecular cloning, sequence comparison, prediction of diverse cis-regulatory motifs, and assessment of the promoter activities under field and in vitro conditions. Vitr. Cell. Dev. Biol. - Plant 50, 673–687 (2014).
Fauteux, F., Blanchette, M. & Strömvik, M. V. Seeder: Discriminative seeding DNA motif discovery. Bioinformatics 24, 2303–2307 (2008).
López, Y., Patil, A. & Nakai, K. Identification of novel motif patterns to decipher the promoter architecture of co-expressed genes in Arabidopsis thaliana . BMC Syst. Biol. 7, S10 (2013).
Zolotarov, Y. & Strömvik, M. De novo regulatory motif discovery identifies significant motifs in promoters of five classes of plant dehydrin genes. PLoS One 10, 1–19 (2015).
Li, X. Q. et al. Detection of nitrogen sufficiency in potato plants using gene expression markers. Am. J. Potato Res. 87, 50–59 (2010).
Zebarth, B. J. et al. Differential gene expression as an indicator of nitrogen sufficiency in field-grown potato plants. Plant Soil 345, 387–400 (2011).
Yang, X. S. et al. Gene expression biomarkers provide sensitive indicators of in planta nitrogen status in maize. Plant Physiol. 157, 1841–1852 (2011).
Hammond, J. P. et al. Gene expression changes in phosphorus deficient potato (Solanum tuberosum L.) leaves and the potential for diagnostic gene expression markers. PLoS One 6, e24606 (2011).
Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: Discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009).
Kim, D. et al. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 14, R36 (2013).
Luo, S. et al. Sample collection protocol effects on quantification of gene expression in potato leaf tissue. Plant Mol. Biol. Report. 29, 369–378 (2011).
de Almeida, M. R. et al. Comparative transcriptional analysis provides new insights into the molecular basis of adventitious rooting recalcitrance in Eucalyptus. Plant Sci. 239, 155–165 (2015).
Amar, D. et al. Evaluation and integration of functional annotation pipelines for newly sequenced organisms: the potato genome as a test case. BMC Plant Biol. 14, 1–14 (2014).
Reuveny, Z., Dougall, D. K. & Trinity, P. M. Regulatory coupling of nitrate and sulfate assimilation pathways in cultured tobacco cells. Proc. Natl. Acad. Sci. USA 77, 6670–6672 (1980).
Bao, S., An, L., Su, S., Zhou, Z. & Gan, Y. Expression patterns of nitrate, phosphate, and sulfate transporters in Arabidopsis roots exposed to different nutritional regimes. Botany 89, 647–653 (2011).
Zambelli, F., Pesole, G. & Pavesi, G. Using Weeder, Pscan, and PscanChIP for the Discovery of Enriched Transcription Factor Binding Site Motifs in Nucleotide Sequences. Curr. Protoc. Bioinformatics 47, 2.11.1–2.11.31 (2014).
Bailey, T. L. et al. MEME Suite: Tools for motif discovery and searching. Nucleic Acids Res. 37, 202–208 (2009).
Gordon, D. B., Nekludova, L., McCallum, S. & Fraenkel, E. TAMO: A flexible, object-oriented framework for analyzing transcriptional regulation using DNA-sequence motifs. Bioinformatics 21, 3164–3165 (2005).
Mathelier, A. et al. JASPAR 2014: An extensively expanded and updated open-access database of transcription factor binding profiles. Nucleic Acids Res. 42, 142–147 (2014).
Higo, K., Ugawa, Y., Iwamoto, M. & Korenaga, T. Plant cis-acting regulatory DNA elements (PLACE) database: 1999. Nucleic Acids Res. 27, 297–300 (1999).
Grierson, C. et al. Separate cis sequences and trans factors direct metabolic and developmental regulation of a potato tuber storage protein gene. Plant J. 5, 815–826 (1994).
Gilmartin, P., Niner, B. & Chua, N. A Metal-dependent DNA-binding protein interacts with a constitutive element of a light-responsive promoter. Society 2, 857–866 (1990).
Villain, P., Clabault, G., Mache, R. & Zhou, D. X. S1F Binding-site is related to but different from the light-responsive Gt-1 binding-site and differentially represses the spinach Rps1 promoter in transgenic tobacco. J. Biol. Chem. 269, 16626–16630 (1994).
Chan, C., Guo, L. & Shih, M. Promoter analysis of the nuclear gene encoding the chloroplast glyceraldehyde-3-phosphate dehydrogenase B subunit of Arabidopsis thaliana . 1, 131–141 (2001).
Zebarth, B. J., Tai, G., Tarn, R., de Jong, H. & Milburn, P. H. Nitrogen use efficiency characteristics of commercial potato cultivars. Can. J. Plant Sci. 84, 589–598 (2004).
Zebarth, B. J., Tremblay, N., Fournier, P., Leblon, B. & Rees, H. Mapping spatial variation in potato nitrogen status using the ‘N sensor’. Acta Hortic. 627, 267–273 (2003).
Tai, H. H. & Zebarth, B. J. Effect of time of day of sampling on potato foliar gene expression used to assess crop nitrogen status. Am. J. Potato Res. 92, 284–293 (2015).
Sánchez, E. et al. Proline metabolism in response to nitrogen deficiency in French Bean plants (Phaseolus vulgaris L. cv Strike). Plant Growth Regul. 36, 261–265 (2002).
Michaeli, S. & Fromm, H. Closing the loop on the GABA shunt in plants: are GABA metabolism and signaling entwined? Front. Plant Sci. 6, 1–7 (2015).
Liepman, A. H. & Olsen, L. J. Peroxisomal alanine: Glyoxylate aminotransferase (AGT1) is a photorespiratory enzyme with multiple substrates in Arabidopsis thaliana . Plant J. 25, 487–498 (2001).
Novitskaya, L., Trevanion, S. J., Driscoll, S., Foyer, C. H. & Noctor, G. How does photorespiration modulate leaf amino acid contents? A dual approach through modelling and metabolite analysis. Plant, Cell Environ. 25, 821–835 (2002).
Bloom, A. J. Photorespiration and nitrate assimilation: a major intersection between plant carbon and nitrogen. Photosynth. Res. 123, 117–128 (2015).
Shelp, B. J., Bown, A. W. & McLean, M. D. Gamma-Aminobutyric Acid. Trends Plant Sci. 4, 446–452 (1999).
Navarro, C. et al. Control of flowering and storage organ formation in potato by FLOWERING LOCUS T. Nature 478, 119–122 (2011).
Konishi, M. & Yanagisawa, S. Arabidopsis NIN-like transcription factors have a central role in nitrate signalling. Nat. Commun. 4, 1617 (2013).
Denancé, N., Szurek, B. & Noël, L. D. Emerging functions of nodulin-like proteins in non-nodulating plant species. Plant Cell Physiol. 55, 469–474 (2014).
Zebarth, B. J. & Milburn, P. H. Spatial and temporal distribution of soil inorganic nitrogen concentration in potato hills. Can. J. Soil Sci. 7, 183–195 (2003).
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 7, 562–78 (2012).
Trapnell, C. et al. Differential analysis of gene regulation at transcript resolution with RNA-seq. Nat. Biotechnol. 31, 46–53 (2013).
Tai, H. H., Conn, G., Davidson, C. & Platt, H. W. (Bud). Arbitrary multi-gene reference for normalization of real-time PCR gene expression data. Plant Mol. Biol. Report 27, 315–320 (2009).
Nakane, E., Kawakita, K., Doke, N. & Yoshioka, H. Elicitation of primary and secondary metabolism during defense in the potato. J. Gen. Plant Pathol. 69, 378–384 (2003).
Mahony, S. & Benos, P. V. STAMP: A web tool for exploring DNA-binding motif similarities. Nucleic Acids Res. 35, 253–258 (2007).
Cock, P. J. A. et al. Biopython: Freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics 25, 1422–1423 (2009).
Crooks, G., Hon, G., Chandonia, J. & Brenner, S. WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190 (2004).
Acknowledgements
This work was supported by a grant from the Natural Sciences and Engineering Research Council of Canada (NSERC) (Grant No. 283303) to M.V.S.; A-base funding from Agriculture and Agri-Food Canada to H.H.T. and B.J.Z.; by the Mexican National Council of Science and Technology (Consejo Nacional de Ciencia y Tecnología; CONACYT) (Scholarship No. 381158) to J.H.G.; and by the McGill Department of Plant Science Graduate Excellence Fund. The authors also acknowledge Centre SÈVE for funding. The authors thank Rui Li for technical help with alternative splicing, Jeff Xia for helpful discussion on ANOVA analyses, Charlotte Davidson and Lana Nolan for technical assistance with gene expression analysis and Karen Terry for technical assistance with field experiments. Computations were made on the supercomputer Guillimin from McGill University, managed by Calcul Québec and Compute Canada. The operation of this supercomputer is funded by the Canada Foundation for Innovation (CFI), NanoQuébec, RMGA and the Fonds de recherche du Québec - Nature et technologies (FRQ-NT).
Author information
Authors and Affiliations
Contributions
J.H.G., H.H.T. and M.V.S. wrote the main manuscript text. H.H.T. and B.J.Z. designed and carried out the field experiment. J.H.G., H.H.T., M.L. and M.V.S. designed and carried out the bioinformatics and computational analyses. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Gálvez, J., Tai, H., Lagüe, M. et al. The nitrogen responsive transcriptome in potato (Solanum tuberosum L.) reveals significant gene regulatory motifs. Sci Rep 6, 26090 (2016). https://doi.org/10.1038/srep26090
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/srep26090
This article is cited by
-
Transcriptome analysis reveals multiple effects of nitrogen accumulation and metabolism in the roots, shoots, and leaves of potato (Solanum tuberosum L.)
BMC Plant Biology (2022)
-
Construction of drought stress regulation networks in potato based on SMRT and RNA sequencing data
BMC Plant Biology (2022)
-
The chloroplast redox-responsive transcriptome of solanaceous plants reveals significant nuclear gene regulatory motifs associated to stress acclimation
Plant Molecular Biology (2022)
-
Transcriptome analysis of potato shoots, roots and stolons under nitrogen stress
Scientific Reports (2020)
-
Structural genome analysis in cultivated potato taxa
Theoretical and Applied Genetics (2020)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.