the ABDC project:
Automated
Big
Data
Clustering
A few words about the project
We aim to classify and characterize a very large number of sequences (not vectors) without neither computing a total distance matrix nor producing a total global alignment, with the aid of global and local quantitative and qualitative descriptors. Efficient parallel revision of models and structures will be the key of the project. Classical distances should be superseded by distance vectors.
A few words on the real meaning of the letters A, B, D and C can be found here (in French) with the following refpage for the corresponding links.
Some documents (with date)
April 2015
The presentation of Matthieu is here and the slides of Benoit are there. For Gilles, the file is descripteurs.pdf.
A short report in French on the meeting of April 9th is available: CR_09avril2015.pdf.
Related pages : eodd.php (the experimental database of descriptors)
Interesting books :
November 2014
The presentation of Matthieu is here and the three slides of Jérôme are there. The PDF of Gilles is abdc_GH_november.pdf.
A short report in French on the meeting of November 6th is available: CR_06novembre2014.pdf.
Related pages : 1055genomes.php txt2codons.php
Retour à la page principale de (gH)