Jump to content

Talk:Multiple sequence alignment

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
Former good articleMultiple sequence alignment was one of the Natural sciences good articles, but it has been removed from the list. There are suggestions below for improving the article to meet the good article criteria. Once these issues have been addressed, the article can be renominated. Editors may also seek a reassessment of the decision if they believe there was a mistake.
Article milestones
DateProcessResult
September 12, 2012Good article nomineeListed
March 7, 2010Good article reassessmentKept
April 11, 2024Good article reassessmentDelisted
Current status: Delisted good article

some clarifications

[edit]

the statement

Because HMMs are probabilistic, they do not produce the same solution every time they are run on the same dataset; thus they cannot be guaranteed to converge to an optimal alignment. HMMs can produce both global and local alignments. Although HMM-based methods have been developed relatively recently, they offer significant improvements in computational speed, especially for sequences that contain overlapping regions.

is incorrect. HMMs are probablistic in the sense that they are a statistical model, however, they are completely deterministic and will produce the same result every time on a given dataset. HMM alignments use the same algorithms as local sequence alignments and therefore have no computational speed advantage.

One of the most common motif-finding tools, known as MEME, uses expectation maximization and hidden Markov methods to generate motifs that are then used as search tools by its companion MAST in the combined suite MEME/MAST.[19][20]

MEME uses a PSSM (position specific scoring matrix), but does not contain insertion or deletion probabilities or other characteristics of a typical sequence HMM.

Gribskov 03:55, 20 September 2007 (UTC)[reply]

Given the specific technological and algorithmic and biological significance of short-read sequence alignment, I think this topic deserves its own page. For example, the differences between short read mapping and de-novo assembly in next-generation sequencing projects could be discussed on such a page. --Dan|(talk) 14:01, 22 January 2009 (UTC)[reply]

Alternative interpretations of MSAs

[edit]

The main use/interpretation of columns in MSAs is that residues in the same column are "related" by either point substitutions or no substitutions at all.

However, there are applications of MSAs where residues in the same column are assumed to be "structurally" equivalent but not necessarily evolutionarily equivalent e.g. http://www.ncbi.nlm.nih.gov/pubmed/16733545 - indeed in some of these applications the aim is to avoid including "homologous" sequences in the alignment e.g. http://www.ncbi.nlm.nih.gov/pubmed/9920390

At the moment this distinction isn't made on the MSA wikipedia page - although the top of the sequence alignment wikipedia page does highlight different interpretations.

First wikipedia post ever here - not quite ready to be bold yet! - so wanted to ask/check whether anyone disagrees with introducing some changes to reflect this distinction to the MSA page? SiggyDood (talk) 12:45, 11 March 2009 (UTC)[reply]

GA Reassessment

[edit]
This discussion is transcluded from Talk:Multiple sequence alignment/GA1. The edit link for this section can be used to add comments to the reassessment.

Starting GA reassessment as part of the GA Sweeps process. Jezhotwells (talk) 19:18, 28 February 2010 (UTC)[reply]

Checking against GA criteria

[edit]
GA review (see here for criteria)
  1. It is reasonably well written.
    a (prose): b (MoS):
    This article appears more like an essay or a paper than an encyclopaedia artcile. Consider a thorough copy-edit for style and clarity.  Done
  2. It is factually accurate and verifiable.
    a (references): b (citations to reliable sources): c (OR):
    I repaired dead links using WP:CHECKLINKS. All references appear to be OK  Done
    There are many uncited paragraphs.  Done
  3. It is broad in its coverage.
    a (major aspects): b (focused):
  4. It follows the neutral point of view policy.
    Fair representation without bias:
  5. It is stable.
    No edit wars, etc.:
  6. It is illustrated by images, where possible and appropriate.
    a (images are tagged and non-free images have fair use rationales): b (appropriate use with suitable captions):
    Images such as File:Caspase-motif-alignment.png and File:RPLP0 90 ClustalW aln.gif are illegible in the article and appear to add little.  Done
  7. Overall:
    Pass/Fail:
    Main concerns: the style of the article is un-encyclopaedic, images add little, many uncited paragraphs. On hold until 7 March, major contributors and projects will be notified. Jezhotwells (talk) 19:29, 28 February 2010 (UTC)[reply]
    OK, thanks for for fixing things up, keep GA status. Jezhotwells (talk) 18:49, 7 March 2010 (UTC)[reply]

Note

[edit]

Most of the unencylopedic style was inserted by a well-meaning but seemingly novice editor. I've removed much of that content because it was redundant with other, cited parts of the article. I've also significantly enlarged the lead image. Without such detail, the image is useless to the reader unless he or she clicks through to the larger media file -- which is unlikely. All paragraphs now have at least one relevant reference. If are any issues that remain to be addressed for the purposes of this reassessment, please let me know. Thanks, Emw (talk) 08:10, 6 March 2010 (UTC)[reply]

Hi, making the image larger doesn't really address the problem which is that the image does not convey any information. Please see WP:MOS#Avoid entering textual information as images and WP:MOS#Images which suggest no larger than 300px for lead images. Jezhotwells (talk) 14:51, 6 March 2010 (UTC)[reply]
Images portraying multiple sequence alignments seem like a valid exception to the guideline discouraging the use of textual information in images. Presumably that guideline pertains to uses of natural language in images. In contrast, the text used in the two images in this article represent sequences of amino acids. It is the convention used to represent MSAs among reliable sources (i.e., textbooks, journal articles, reliable websites). Also, I think the lead image's detail necessitates its larger-than-usual dimensions. Such exceptions are provided for in the MOS: "Images containing important detail (for example, a map, diagram, or chart), and which may need larger sizes than usual." Emw (talk) 15:23, 6 March 2010 (UTC)[reply]
OK, I'll buy that, Another possibility, which I ask tou to consider is moving the image elsewhere in teh artcile so that it doesn't sandwich the lead. Jezhotwells (talk) 18:49, 7 March 2010 (UTC)[reply]
[edit]

Hello fellow Wikipedians,

I have just modified 4 external links on Multiple sequence alignment. Please take a moment to review my edit. If you have any questions, or need the bot to ignore the links, or the page altogether, please visit this simple FaQ for additional information. I made the following changes:

When you have finished reviewing my changes, you may follow the instructions on the template below to fix any issues with the URLs.

This message was posted before February 2018. After February 2018, "External links modified" talk page sections are no longer generated or monitored by InternetArchiveBot. No special action is required regarding these talk page notices, other than regular verification using the archive tool instructions below. Editors have permission to delete these "External links modified" talk page sections if they want to de-clutter talk pages, but see the RfC before doing mass systematic removals. This message is updated dynamically through the template {{source check}} (last update: 5 June 2024).

  • If you have discovered URLs which were erroneously considered dead by the bot, you can report them with this tool.
  • If you found an error with any archives or the URLs themselves, you can fix them with this tool.

Cheers.—InternetArchiveBot (Report bug) 04:37, 8 February 2018 (UTC)[reply]

GA Reassessment

[edit]

The following discussion is closed. Please do not modify it. Subsequent comments should be made on the appropriate discussion page. No further edits should be made to this discussion.


Article (edit | visual edit | history) · Article talk (edit | history) · WatchWatch article reassessment pageMost recent review
Result: Delisted. Hog Farm Talk 16:58, 11 April 2024 (UTC)[reply]

In addition to substantial uncited text, the lead of the article is a bit too difficult and the body contains large numbers of external links. —Femke 🐦 (talk) 17:12, 29 March 2024 (UTC)[reply]

A request to make the lead image smaller from GA1 doesn't seem to have happened. It is still too big for MOS:IMGSIZE. As well as the unsourced sections and inappropriate extlinks there is a lot of material that appears to be primary-sourced and promotionally worded about individual research projects or implementations, rather than being based on published works by disinterested parties surveying and reviewing the methods that are available, I think maybe problematic with respect to WP:GACR#3b (going into excessive detail). —David Eppstein (talk) 02:07, 1 April 2024 (UTC)[reply]
I'd like to make time to go over this article this weekend, and I'd appreciate the reassessment remaining open for just a few extra days. Thanks ― Synpath 04:57, 5 April 2024 (UTC)[reply]
I've gone through and removed external links in the body of the article and adjusted the lead, but that only amounts to cosmetic changes to the article. I can see now that handling the citations and removing the conversational tone of the article is more editing than I'm willing to spend time on. Thanks for keeping the discussion open. ― Synpath 06:01, 8 April 2024 (UTC)[reply]
The discussion above is closed. Please do not modify it. Subsequent comments should be made on the appropriate discussion page. No further edits should be made to this discussion.