Russian version English version
Volume 8   Issue 1   Year 2013
Isaev E.A., Kornilov V.V.

The Problem of Processing and Storage of Large Amounts of Scientific Data and Approaches to Its Solution

Mathematical Biology & Bioinformatics. 2013;8(1):49-65.

doi: 10.17537/2013.8.49.

References

  1. Howe D, Costanzo M, Fey P, Gojobori T, Hannick L, Hide W, Hill DP, Kania R, Schaeffer M, St Pierre S, et al. Big data: the future of biocuration. Nature. 2008;455:47-50. doi: 10.1038/455047a
  2. PMC - a free full-text archive of biomedical and life sciences journal literature at the U.S. National Institutes of Health's National Library of Medicine (NIH/NLM). http://www.ncbi.nlm.nih.gov/pmc/  (accessed 10 February 2013).
  3. MIKE2.0. The open source standard for Information Management. Big Data Definition. http://mike2.openmethodology.org/wiki/Big_Data_Definition  (accessed 10 February 2013).
  4. Manyika J, Chui M, Brown B, Bughin J, Dobbs R, Roxburgh C, Byers AH. Big data: The next frontier for innovation, competition, and productivity: McKinsey Global Institute Report. 2011. http://www.mckinsey.com/insights/mgi/research/technology_and_innovation/big_data_the_next_frontier_for_innovation  (accessed 10 February 2013).
  5. Kanarakus K. Seti (Network World). 2011;04. http://www.osp.ru/nets/2011/04/13010802/  (accessed 10 February 2013) (in Russ.).
  6. Lynch C. How do your data grow? Nature. 2008;455(7209):28-29. doi: 10.1038/455028a
  7. Human Genome Project Information Website. http://www.ornl.gov/sci/techresources/Human_Genome/home.shtml  (accessed 10 February 2013).
  8. Drmanac R, Sparks AB, Callow MJ, Halpern AL, Burns N L, Kermani BG, Carnevali P, Nazarenko I, Nilsen GB, George Yeung G, et al. Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays. Science. 2010;327:78-81. doi: 10.1126/science.1181498
  9. Pell J, Hintze A, Canino-Koning R, Howe A, Tiedje JM, Brown CT. Scaling metagenome sequence assembly with probabilistic de Bruijn graphs. PNAS. ;109:13272-13277. doi: 10.1073/pnas.1121464109
  10. Eric ES, Linderman MD, Sorenson J, Lee L, Nolan GP. Computational solutions to large-scale data management and analysis. Nat Rev Genet. 2010;11:647-657. doi: 10.1038/nrg2857
  11. Loh P, Baym M, Berger B. Compressive genomics. Nature Biotechnology. 2012;30:627-630. doi: 10.1038/nbt.2241
  12. E-health Standards and Interoperability: ITU-T Technology Watch Report. April 2012. http://www.itu.int/dms_pub/itu-t/oth/23/01/T23010000170001PDFE.pdf  (accessed 10 February 2013).
  13. Castro D. The Role of Information Technology in Medical Research. In: IEEE 2009: Atlanta Conference on Science, Technology and Innovation Policy (October 2009). 2009.
  14. Joint Center for Computational Biology and Bioinformatics PRC RAS . http://www.jcbi.ru/index.html  (accessed 10 February 2013).
  15. Brumfiel G. High-energy physics: Down the petabyte highway. Nature. 2011;469(7330):282-283. doi: 10.1038/469282a
  16. Essers L. Computerworld Rossiia (Computerworld Russia). 2011;18 (in Russ.).
  17. The Sloan Digital Sky Survey. http://www.sdss.org/  (accessed 10 February 2013).
  18. Data, data everywhere. A special report on managing information. The Economist. 2010.
  19. Stephens M. Petabyte-chomping big sky telescope sucks down baby code. The Register. http://www.theregister.co.uk/2010/11/26/lsst_big_data_and_agile  (accessed 10 February 2013).
  20. Boon M. Astronomical Computing. Symmetry Breaking. http://www.symmetrymagazine.org/breaking/2010/10/18/astronomical-computing  (accessed 10 February 2013).
  21. LOFAR website. http://www.lofar.org/  (accessed 10 February 2013).
  22. SKA Project website. http://www.skatelescope.org/  (accessed 10 February 2013).
  23. Pugachev VD, Isaev EA, Amzarakov MB, Samodurov VA, Sukhov RR, Kobylka NA. In: Vserossiiskaia radioastronomicheskaia konferentsiia (Russian Radio Astronomy Conference): book of abstracts. Sainct-Petersburg; 2011. P. 144 (in Russ.).
  24. RadioAstron Project Site. http://www.asc.rssi.ru/radioastron/rus/index.html  (accessed 10 February 2013).
  25. Indiana launches new ultra-high-speed network. University Information Technology Services. http://uitsnews.iu.edu/2012/01/31/indiana-launches-new-ultra-high-speed-network  (accessed 10 February 2013).
  26. Dubova N. At the Cutting Edge of Big Data. Otkrytye sistemy (Open Systems Journal). 2012;3 (in Russ.).
  27. The Apache Software Foundation Project. http://www.apache.org/foundation  (accessed 10 February 2013).
  28. Apache Hadoop project website. http://hadoop.apache.org  (accessed 10 February 2013).
  29. White T. Hadoop: The Definitive Guide. Storage and Analysis at Internet Scale. 3rd Edition. O'Reilly Media; Yahoo Press.; 2012. 688 p.
  30. Dean J, Ghemawat S. MapReduce: Simplified data processing on large clusters. In: Proceedings of the Sixth Conference on Operating System Design and Implementation. Berkeley; 2004.
  31. Sadalage P, Fowler M. NoSQL Distilled. Pearson Education; 2012. 192 p.
  32. Stonebraker M, Abadi D, Dawitt DJ, Madden S, Paulson E, Pavlo A, Rasin A. MapReduce and Parallel DBMSs: Friends or Foes? Communications of the ACM. 2010;53(1). doi: 10.1145/1629175.1629197
  33. Pavlo A, Paulson E, Rasin A, Abadi DJ, DeWitt DJ, Madden SR, Stonebraker M. A comparison of approaches to large-scale data analysis. In: Proceedings of the 35th SIGMOD International Conference on Management of Data. NewYork: ACM Press; 2009. P. 165-178.
  34. Chernyak L. Platforms for Big Data. Otkrytye sistemy (Open Systems Journal). 2012;7 (in Russ.).
  35. Artemov S. PC Week/RE. http://www.pcweek.ru/upload/iblock/d05/jet-big-data.pdf  (accessed 10 February 2013) (in Russ.).
  36. Announcing the New SGI UV: The Big Brain Computer. Business Wire. http://www.businesswire.com/news/home/20120618005340/en  (accessed 10 February 2013).
  37. Vykhodcev A. A Platform for Big Data. Otkrytye sistemy. 2012;6 (in Russ.).
  38. Yakhina I. Warehouse for a Big Data. Otkrytye sistemy. 2012;7 (in Russ.).
  39. Serov D. The «sar» for analysts. Otkrytye sistemy. 2011;4 (in Russ.).
  40. Cherniak L. Computerworld Rossiia (Computerworld Russia). 2011;14 (in Russ.).
  41. EGEE-RDIG Project. http://www.egee-rdig.ru  (accessed 10 February 2013).
  42. TOP500 List of the world’s top supercomputers. November 2012. http://www.top500.org/lists/2012/11/  (accessed 10 February 2013).
  43. http://www.olcf.ornl.gov/titan/  (accessed 10 February 2013).
  44. http://parallel.ru/cluster/lomonosov.html  (accessed 10 February 2013).
  45. The OpenNet Project. http://www.opennet.ru/opennews/art.shtml?num=35358  (accessed 10 February 2013).
  46. The Graph 500 List. http://www.graph500.org/  (accessed 10 February 2013).
  47. Linux Cluster, PRC RAS. http://www.jcbi.ru/EN/klaster/index.shtml  (accessed 10 February 2013).
  48. Lakhno VD, Isaev EA, Pugachev VD, Zaitsev AYu, Fialko NS, Rykunov SD, Ustinin MN. Development of Information and Communication Technologies in Pushchino Research Center of the Russian Academy of Sciences. Mathematical Biology and Bioinformatics. 2012;7(2):529-544 (in Russ.). doi: 10.17537/2012.7.529
  49. Shatskaya MV, Andrianov AA, Girin IA, Isaev EA, Kostenko VI, Likhachev SF, Pimakov AS, Seliverstov SI, Fedorov NA. Organization of Scientific Data Processing Center for Radio Interferometric Projects. Cosmic Research. 2012;50(4):324. doi: 10.1134/S0010952512040065
Table of Contents Original Article
Math. Biol. Bioinf.
2013;8(1):49-65
doi: 10.17537/2013.8.49
published in Russian

Abstract (rus.)
Abstract (eng.)
Full text (rus., pdf)
References

 

  Copyright IMPB RAS © 2005-2022