Fundamental introduction to regular expression (regex or regexp for short) in R. Repost by indicating the source please!
One week ago, I encoutered a problem when I was learning The Art of R Programming (page 35), I would like to use pattern match, that is regex, however, something unusual happened. I sought for help on stackoverflow
, you could find the question here (unfortunately duplicated). The answer by akrun invokes my interest on regex!
Regex is kind of independent but important concept that is almost implemented by every programming language. Although this post is titled with “01”, I suggest, you could come back learning this when you come across some concrete problems related, that would make you understand more clearly!
本讲主要介绍正则表达式(regular expression, regex)的概念、主要语法风格以及在R中的使用方法。欢迎转载,但请注明出处!
一周前,在学习The Art of R Programming第35页的时候遇到一个练习,我想用模式匹配去解决,也就是regex,但遇到了未曾料到的问题,于是我在stackoverflow上提问了,大牛的解答让我想要集中几天精力搞清楚regex到底怎么用,问题见此:my question,不巧被标记了重复…
regex是一个相对独立的内容,几乎所有编程语言都实现了这种功能,这也证明了它的重要性,另外,本讲标题序号虽然是01,但其实大家可以放在遇到实际问题时再来学,那样有了共鸣,印象更深刻!
I would like to post a series recording my way of learning R language, with the name “leaRning”! Repost by indicating the source please!
Because I swithed my major to bioinformatics in which data manipulation is the core, on the other hand, the R programming language is irreplaceable now for data science, so learning R is a necessity and must! What’s more, it would be much easier learning other programming languages after you conquer one.
我准备更新一个记录自己学习R语言的系列,暂时称其为学R学!一来对自己的知识作一巩固和总结,二来若能点滴帮助想学R的朋友,实属荣幸!欢迎转载,但请注明出处!
因为我要转行做生物信息,生物信息的核心之一就是处理生物数据,而R语言是专为数据处理领域设计的编程语言,所以我要学,也不得不学!做数据,R语言是不二选择,其他的语言,等学好R后,触类旁通,不会太难。
This is a brief tutorial about methods of downloading sra, sam and fastq files, mainly focusing on Aspera Connect. Repost by indicating the source please!
SRA: Sequence Read Archive: It belongs to NCBI (National Center for Biotechnology Information), is a database storing high throughput sequencing (HTS) raw data, alignment information and metadata. Almost all HTS data in published publications will be asked uploading to here, and stored as .sra compressed file format.
这是个简短的教程,目的是介绍几种比较方便快捷的下载SRA、SAM及Fastq文件的方法。欢迎转载,但请注明出处!
SRA数据库: Sequence Read Archive:隶属NCBI (National Center for Biotechnology Information),它是一个保存高通量测序原始数据以及比对信息和元数据 (metadata) 的数据库,所有已发表的文献中高通量测序数据基本都上传至此,方便其他研究者下载及再研究。其中的数据则是通过压缩后以.sra文件格式来保存的。