sci-biology/sim4 (gentoo)

Search

Package Information

Description:
sim4 is a similarity-based tool for aligning an expressed DNA sequence (EST, cDNA, mRNA) with a genomic sequence for the gene. It also detects end matches when the two input sequences overlap at one end (i.e., the start of one sequence overlaps the end of the other).sim4 employs a blast-based technique to first determine the basic matching blocks representing the "exon cores". In this first stage, it detects all possible exact matches of W-mers (i.e., DNA words of size W) between the two sequences and extends them to maximal scoring gap-free segments. In the second stage, the exon cores are extended into the adjacent as-yet-unmatched fragments using greedy alignment algorithms, and heuristics are used to favor configurations that conform to the splice-site recognition signals (GT-AG, CT-AC). If necessary, the process is repeated with less stringent parameters on the unmatched fragments.
Homepage:
http://globin.cse.psu.edu/html/docs/sim4.html
License:
GPL-2

Versions

Version EAPI Keywords Slot
20030921-r2 7 ~amd64 ~ppc ~x86 0

Metadata

Description

Maintainers

Raw Metadata XML
<pkgmetadata>
	<maintainer type="project">
		<email>sci-biology@gentoo.org</email>
		<name>Gentoo Biology Project</name>
	</maintainer>
	<longdescription>
		sim4 is a similarity-based tool for aligning an expressed DNA sequence
		(EST, cDNA, mRNA) with a genomic sequence for the gene. It also detects
		end matches when the two input sequences overlap at one end (i.e., the
		start of one sequence overlaps the end of the other).sim4 employs a
		blast-based technique to first determine the basic matching blocks
		representing the "exon cores". In this first stage, it detects all
		possible exact matches of W-mers (i.e., DNA words of size W) between
		the two sequences and extends them to maximal scoring gap-free
		segments. In the second stage, the exon cores are extended into the
		adjacent as-yet-unmatched fragments using greedy alignment algorithms,
		and heuristics are used to favor configurations that conform to the
		splice-site recognition signals (GT-AG, CT-AC). If necessary, the
		process is repeated with less stringent parameters on the unmatched
		fragments.
	</longdescription>
</pkgmetadata>

Lint Warnings

Files

Manifest

Type File Size Versions
Unmatched Entries
Type File Size
DIST sim4-20030921.tar.gz 60814 bytes