The goal of this research was to develop a new genetic database of simple sequence repetition (SSR) primers for faba and classify
them according to their target genes and respective biological processes. Approximately 75,605 and 148,196 previously published genomic and transcriptomic faba sequences, respectively, have been used to detect possible SSRs. The numbers of
identified SSRs from each dataset were 25,502 and 12,319, respectively. The distribution of different repeat classes indicated
that trinucleotides represent the largest number of repeat counts, followed by dinucleotides. The extracted genic SSR sequences
were used to design 1091 polymerase chain reaction (PCR) primers, of which only 238 (21.8%) primers target genomic
sequences and the other 853 PCR primers targeted transcriptomic sequences. The annotation of gene-targeted SSRs showed
that approximately 897 genes were targeted by our SSR primers. Approximately 1890 gene ontology (GO) identification codes
have been obtained. The GO keywords were distributed among distinct molecular cell features. The highest redundancies
involved 554 technical words, 196 domains, and 160 molecular feature phrases. These GO codes belonged to the general level
of GO and included molecular function, cellular component, and biological process (544, 670, and 676 GOs, respectively).
Twenty-seven SSR PCR primers were synthesized to 12 Egyptian faba bean genotypes. Approximately 11 SSR provided one to
two PCR bands, whereas other SSRs provided only one sharp band with polymorphic band size. There were 13 polymorphic
primers. The polymorphism information content was 0.3, which implied moderate informativeness. |