Identification of Superior Faba bean (Vicia faba L.) Accessions for Key Morphological and Yield-related Traits

Y
Yash Kumar Singh1,*
P
Parshuram Sial2
M
Manoj Kumar3
T
Thamaraikannan Sivakumar4
Y
Yogesh Kumar1
A
Ananya Singh5
V
V.P. Sahi6
1Department of Horticulture, Sam Higginbottom University of Agriculture, Technology and Sciences, Prayagraj-211 007, Uttar Pradesh, India.
2Regional Research and Technology Transfer Station, Odisha university of Agriculture and Technology, Semiliguda, Koraput-763 002, Odisha, India.
3Department of Plant Breeding and Genetics, Bihar Agricultural University Sabour-813 210, Bihar, India.
4Division of Genomic Resources, National Bureau of Plant Genetic Resources, New Delhi-110 012, India.
5Department of Plant Pathology, Sam Higginbottom University of Agriculture, Technology and Sciences, Prayagraj-211 007, Uttar Pradesh, India.
6Department of Genetics and Plant Breeding, Sam Higginbottom University of Agriculture, Technology and Sciences, Prayagraj-211 007, Uttar Pradesh, India.
  • Submitted18-09-2025|

  • Accepted25-12-2025|

  • First Online 30-12-2025|

  • doi 10.18805/LR-5573

Background: Faba bean (Vicia faba L.) is a multipurpose legume whose genetic improvement relies on exploiting existing morphological variation. Rigorous phenotypic evaluation provides insights into heritability, trait associations and divergence patterns, which are crucial for identifying superior genotypes and its selections in breeding programs.

Methods: Forty-five genotypes, including three checks, were evaluated across two Rabi seasons in a randomized block design for 20 agro-morphological traits spanning phenology, plant architecture, pod/seed attributes and yield. Best Linear Unbiased Estimates (BLUEs) were computed across replications and seasons.Broad-sense heritability. Genotypic variation, genetic divergence, D2 statistics, trait associations and the Multi-trait Genotype-Ideotype Distance Index (MGIDI) were applied to identify superior accessions.

Result: Analysis of variance showed significant genotypic variation for most of the traits, with high heritability. However, leaflet width and seeds per pod had moderate heritability. Mahalanobis D2 divided genotypes into four clusters. Cluster IV displayed strong yield traits and significant differences from Cluster I, indicating potential for hybrid vigour. Correlation analysis revealed positive relationships, such as maturity with pods per plant and pod length, along with some trade-offs. MGIDI found nine promising accessions that combine earliness, good pod and seed traits and appealing structure. Overall, these results confirm a wide range of phenotypic diversity and provide a clear set of accession for improving faba bean.

Faba bean (Vicia faba L.) is an annual legume belonging to the family Fabaceae. It is widely known by several names such as broad bean, horse bean, windsor bean, tick bean and bakela (Hawtin and Hebblethwaite, 1983). Global production of faba bean continues to rise, with production of approximately 6.77 million tonnes of dry beans and 1.81 million tonnes of green beans in 2022, cultivated over ~6.63 million and 0.64 million acres, respectively (FAOSTAT, 2024). Recent market forecasts further emphasize the growing demand. IMARC Group (2025) reported that global fava bean production reached 9.3 million tonnes in 2024 and it is projected to expand to 12.9 million tonnes by 2033, growing at a compound annual growth rate of 3.5%.
       
Although it is grown in limited areas in India and considered a minor crop, it holds promise due to its nutritional value, adaptability and potential role in crop diversification (Singh et al., 2012a; Singh et al., 2013). Faba bean is highly valued as a source of plant-based protein, with seeds containing 22-35% protein, 58% carbohydrates and 25% dietary fibre (USDA, 2021; Martineau-Côté et al., 2022). In addition to its nutritional role, the crop provides several bioactive compounds, including flavonoids, phenolics and L-DOPA (a precursor of dopamine), which have potential medicinal applications in the treatment of Parkinson’s disease (Vered et al., 1997). The crop also serves as an important source of food and feed in developing countries, while in developed nations it is mainly utilized as animal feed (Gasim and Link, 2007; Singh and Bhatt, 2012a).
       
Globally, more than 30,000 germplasm of faba bean are conserved in gene banks (Singh and Bhatt, 2012b). This genetic variability presents significant opportunities for crop improvement, but effective utilization requires a systematic characterization and evaluation. Morphological characterization remains the first and most practical step in germplasm evaluation, providing information on the extent of variation for traits of agronomic and breeding importance (Salem, 2009; Hiywotu et al., 2022). Grouping accessions and identifying important genotypes for trait-specific breeding are made easier by multivariate techniques like principal component analysis and cluster analysis Zayed et al. (2022) for instance, used PCA to identify the characteristics that contributed to significant phenotypic divergence in faba bean accession and to direct parent selection.
Plant material and field conditions
 
A total of 42 diverse faba bean accessions, along with three checks (HFB 1, HFB 2, Vikrant), were obtained from ICAR-NBPGR, New Delhi and evaluated for morphological traits. The trial was conducted in an RBD at the Horticultural Research Farm, SHUATS, Prayagraj, during rabi 2022-23 and 2023-24. Each accession was grown in six-row plots (1.5 × 2.0 m²), with five tagged plants per replication for data recording.
 
Data collection
 
In the present study, a set of 20 phenotypic traits was evaluated for faba bean to understand their breeding importance. All the traits have been characterized according to the DUS standard. These traits included: days of germination (DG), days to first flowering (DFF), days to 50% flowering (DF), number of branches per plant (NBP), number of nodes on the main branch (NNMB), number of flowers per plant (NFP, number), plant height (PH), leaflet length (LL), leaflet width (LW), number of pods per plant (NPP), pod length (PL), pod width (PW), number of seeds per pod (NSP), biological yield (BY), economical yield (EY), harvest index (HI), days to first pod formation (DPF), seed index (SI), days to maturity (DM) and yield per hectare (YPH).
 
Statistical analysis
 
For each trait recorded, data were analyzed for the Analysis of Variance (ANOVA) along with the significance levels of *p<0.05 (significant), **p<0.01 (highly significant) and ***p<0.001 (extremely significant) and Broad sense heritability using the agricolae package of R. Each replication in the two seasons comprised data collected from five plants per replication and the Best Linear Unbiased Estimates (BLUEs) values were computed using the lme4 package in R (Bates, 2010) for each genotype across the three replications. This approach minimized the environmental variation and provided unbiased estimates of each accession.
       
Based on the BLUE values estimated, summary statistics have been calculated using the agricolae package in R to verify the central tendency and trait performance across the accessions. Furthermore, the distribution and trends of each trait among the accessions were visualized using histograms overlaid with frequency curves, generated with the ggplot2 package in R (Wickham, 2011).
 
D2 clustering analysis
 
A D2 analysis was performed to group the accessions into various clusters based on their phenotypic BLUE values for all traits, enabling us to identify accessions with identical or distinct phenotypic characteristics. This clustering enables us to understand the nature of variation within and between accessions, which is essential in the breeding program for selecting parents. Phenotypic divergence of accessions was measured using a generalized distance (D2) proposed by Mahalanobis (1936), a multivariate distance that considers correlations among traits and accurately differentiates among accessions. Furthermore, intra and inter-cluster distances were determined to assess the relationships of clusters and the extent of dissimilarity among them to identify highly dissimilar accessions to hybridize and succeed in improving faba bean traits.
 
Correlation analysis
 
Correlation analysis determines the level of association between the traits under study, thereby establishing key conclusions. The 20 morphological traits were compared in pairs to produce the estimated pairwise correlations using the corr_plot function of the metan package in R (Olivoto and Lúcio, 2021). The analysis was performed to determine the strength and relationship of association between traits, to identify interdependent traits that may serve as indirect selection criteria during the breeding process. The significance of the correlations was also tested, with correlations referred to as significant (*p<0.05), highly significant (**p<0.01) and extremely significant (***p<0.001).
 
Selection of the best accessions
 
The Multi-Trait Genotype-Ideotype Distance Index (MGIDI) was calculated according to the procedure outlined by (Olivoto et al., 2022). The pre-determined desired ideotype trait performance used the BLUE values of each accession. Factor analysis was conducted to reduce multicollinearity and generalize trait variation and the genotype scores were projected in the reduced factorial space. The MGIDI of each accession was computed as the eclidean distance between the genotype and the ideotype, with lower values denoting a closer resemblance to the ideotype. All calculations were performed in R version 4.1.2 (R Core Team, 2020), utilizing the metan package (Olivoto and Lúcio, 2021) and visualized with ggplot2 (Wickham, 2011).
Analysis of variance and heritability
 
Analysis of variance revealed significant differences among accessions for most of the traits (DG, DFF, D50F, NBP, NNMB, NFP, PH, LL, NPP, PL, PW, BY, EY, HI, DPF, SI, DM, YPH). This confirms substantial genetic variation. LW and NSP showed moderate variation (P<0.05). Replication effects were non-significant (Table 1), indicating that differences were mostly due to genotypes.

Table 1: Anova table.


       
Reproductive traits (DFF, D50F, NFP, NPP) and morphological traits (PH, LL, PL, PW, HI) showed strong effects from genotypes, making them useful for selection. Broad-sense heritability was high (>60%) for most of the traits. LW had a moderate value of 38.1%, while NSP was also moderate; PW had the highest value at 98.2%. No traits fell in the low heritability range.
       
The frequency distribution (Fig 1) showed that most accessions yielded 32-36 g/plant, with only four exceeding 37 g/plant. Seventeen accessions flowered at 60-62 days, with four flowering earlier. Nine accessions produced the highest number of flowers (25-28) and seven matured early (~110 days), while most matured around 120 days.

Fig 1: Frequency histogram for the plant morphological traits for the faba bean accessions.


 
D2 clustering and cluster trait variation
 
D² cluster analysis grouped the 45 accessions into four clusters based on 20 traits (Fig 2). Cluster I included 13 accessions, such as IVT HB18-10, IVT HB19-10 and the check Vikrant. Cluster II had 11 accessions, including IVT HB19-12, AVT I HB16-15 and the check HFB-2 (Table 2). Both Cluster I and Cluster II showed lower average values for yield traits. Cluster III had 9 accessions, while Cluster IV contained 11 accessions. Clusters III and IV recorded higher averages, with Cluster IV surpassing the others in biomass yield, grain yield, flowering, maturity and harvest index. This highlights its potential for high yield (Fig 3).

Fig 2: D2 clustering of the faba bean accessions based on phenotypic traits.



Fig 3: Cluster wise variation of the traits in each cluster.



Table 2: List of genotypes under D2 clusters.


       
The variation within clusters was highest in Cluster IV (7.68) and Cluster II (7.66) and lowest in Cluster III (4.23). The largest differences between clusters happened between Cluster I and Cluster IV (7.68) and between Cluster I and Cluster III (6.91). The smallest difference was between Cluster II and Cluster I (4.3) (Fig 4). These results show clear differences in traits, with Cluster III being the smallest and Cluster IV exhibiting the most tremendous internal variation.

Fig 4: Intra and iter cluster dstance matrix.


 
Correlation
 
Correlation analysis of morphological characteristics in faba bean demonstrated that morphological characteristics were differently associated with flowering, pod and yield characteristics, as illustrated in (Fig 5). The days to first flowering had a strong positive and significant correlation with days to maturity (r = 0.45), but a negative and non-significant correlation with the pod weight (r = -0.45). Days to fifty percent flowering is positively correlated and highly significant with the biological yield (r= 0.99) and the economic yield (r = 0.66). There were positive and significant relationships between days to maturity and the number of pods per plant (r = 0.87), pod length (r = 0.91) and number of Seeds per pod (r = 0.59).

Fig 5: Correlation scatterplot for the faba bean morphological traits.


       
Pod length is negatively correlated with pod width (r = -0.02) and highly significantly correlated positively with the number of seeds per pod (r = 0.63) and seed index (r = 0.86). Pod width is negatively correlated with the number of pods per plant (r=-0.04). The number of seeds per pod is highly significant and positively correlated with the seed index (r = 0.86), as well as showing a positive correlation with both biological yield (r = 0.29) and economic yield (r = 0.15). Biological yield is positively correlated with yield per hectare (r = 0.07). Biological yield is positively correlated with the number of branches per plant (r=0.24), number of flowers per plant (r=0.15), leaf length (r = 0.18), leaf width (r=0.14) and number of pods per plant (r=0.16).
 
Selection of superior faba bean accessions
 
MGIDI analysis identified nine top accessions (GP RFBGP-42, GP RFBGP-79, HFB-1, RFB-48, GP FLRP-20, GP FLRP-31, GP RFBGP-44, GP FLRP-29, GP FLRP-23) that closely matched the ideal profile. These accessions combine early maturity, favorable pod and seed traits and strong plant structure (Fig 6). They are promising choices for yield stability and genetic improvement in breeding programs.

Fig 6: MGIDI analysis for the selection of superior faba bean accessions.


       
Principal component analysis (PCA) found that the first five principal components (PCs) explained 79.2% of the total variation, with PC1 contributing 44.3% (Table 3). Factor analysis (mean communality = 0.79; grouped traits into five factors: FA1 (DG, PL, NBP, NFP, NPP, SI) related to yield potential; FA2 (D50F, BY); FA3 (HI, EY); FA4 (NNMB, PW); and FA5 (LW, PH) (Table 4 and Fig 7).

Table 3: Principal component analysis (PCA) for MGIDI.



Table 4: Factor analysis (FA)-Factor loadings after rotation.



Fig 7: Contribution of traits to MGIDI across five factors in superior genotypes selection.


       
Selection differential analysis showed positive genetic gains in NBP, NFP, NPP, PL, SI, YPH, HI, PH and LW, which support breeding goals. In contrast, traits like DFF, DPF, DM, D50F and BY displayed negative differentials. The MGIDI rankings (Table 5) incorporated these patterns to prioritize accessions with the best trait combinations.

Table 5: Selection differential (SD) for MGIDI analysis.


 
Statistical significance
 
Faba bean accessions exhibited highly significant genotypic variations for most of the traits, highlighting the presence of substantial genetic variability within the studied panel-a fundamental requirement for selection and crop improvement. Similar findings were previously reported by Toker (2004), who consistently observed high heritability values for yield-related traits in diverse faba bean germplasm. The predominance of high heritability estimates (>60%) in this study suggests that many traits are primarily governed by additive genetic factors, making them suitable for direct phenotypic selection. Comparable results were also demonstrated by Hamza and Hussein (2024), who identified high heritability for plant height, pods per plant and seed weight in Sudanese faba bean landraces.
       
By contrast, traits with moderate heritability, such as leaflet width and number of seeds per pod, appear to be more environmentally influenced, necessitating evaluation across multiple environments and seasons. This aligns with the findings of Hiywotu et al., (2022), who reported that genotype × environment interactions play a critical role in shaping trait heritability and selection efficiency in Ethiopian faba bean accessions.

Clustering signifies the phenotypic trait variation
 
The clustering by Mahalanobis D2 revealed distinct groups of accessions, reflecting strong phenotypic influence that can be strategically exploited in breeding. Accessions from clusters separated by large inter-cluster distances are particularly valuable for crossing because they maximize the potential for heterosis and recovery of transgressive segregants. At the same time, clusters with higher intra-cluster variability provide opportunities for effective within-cluster selection. Comparable findings were reported by Dewangan et al., (2022), who observed clear cluster differentiation among faba bean lines, with distant clusters offering the best parental combinations for yield improvement. Similarly, Upadhyay and Pandey (2025) demonstrated that clusters with superior mean performance for seed yield and related traits are prime candidates for hybridization, thereby broadening the genetic base and enhancing productivity.

Correlation across the morphological traits
 
The patterns of correlation between flowering, maturity, pod characteristics and yield highlight the intricate relationship between faba bean productivity. In line with Mishra et al., (2023), who found that harvest index and biological yield had a significant direct impact on seed yield, positive correlations between reproductive timing and yield components imply that enhancing one can improve others. However, the existence of both positive and negative correlations highlights the importance of careful selection in balancing trade-offs. Hiywotu et al., (2023) reported similar results, showing that traits related to pods and seeds had a significant impact on grain yield through direct and indirect effects, indicating their usefulness as selection criteria.
 
Superior faba bean accessions selection
 
Using the MGIDI framework enabled simultaneous, balanced selection across phenology, yield and architecture while controlling for multicollinearity via factor analysis; by ranking genotypes on their distance to an agronomic ideotype, MGIDI prioritizes accessions that deliver multi-trait gains without over-weighting redundant, correlated traits (Olivoto and Lúcio, 2021; Olivoto et al., 2022). The analysis captured by the leading principal components together with high trait communalities supports the reliability of the multivariate signal underlying the MGIDI ranking; comparable patterns-where the first PCs explain most agronomic variation and clearly separate superior, often earlier and higher-yielding faba bean genotypes-are reported in recent faba bean multivariate studies (Soliman et al., 2024). In this context, positive selection differentials for pod-and seed-related traits alongside negative differentials for flowering and maturity are consistent with ideotype-based improvement that jointly targets sink capacity and earliness, as advocated in faba bean PCA-guided selection and in cross-crop MGIDI applications (Olivoto and Lúcio, 2021; Olivoto et al., 2022).
This study revealed substantial genetic variability among 45 faba bean genotypes for key morphological and yield-related traits, with high heritability supporting the effectiveness of direct phenotypic selection. Cluster and divergence analyses identified Cluster IV as a valuable source of superior genotypes. They highlighted inter-cluster combinations, particularly between Clusters I and IV, as promising for heterosis and broadening the genetic base. Correlation analysis confirmed pods per plant, pod length and seed index as reliable indirect selection traits, while weaker associations emphasized the importance of managing trade-offs in breeding. Multivariate approaches, including PCA and MGIDI, effectively integrated trait information and prioritized ideotype-proximal genotypes such as GP RFBGP-42, GP RFBGP-79 and HFB-1 RFB-48 for advancement. Collectively, these findings provide a clear framework for selecting parents and advancing promising lines.
The authors express their gratitude to, Department of Horticulture, SHUATS, Prayagraj, India, for the experimental field and academic support.
The authors declare that there are no conflicts of interest regarding the publication of this article. No funding or sponsorship influenced the design of the study, data collection, analysis, decision to publish, or preparation of the manuscript.

  1. Bates, D. (2010). Mixed models in R using the ime4 package. Part 3: Linear mixed models with simple, scalar random effects.  Madison (WI): University of Wisconsin-Madison. 

  2. Dewangan, N.K., Dahiya, G.S., Janghel, D.K. and Dohare, S. (2022). Diversity analysis for seed yield and its component traits among faba bean (Vicia faba L.) germplasm lines. Legume Research. 45(6): 689-694. doi: 10.18805/LR-4301.

  3. FAOSTAT. (2024). Global production and area harvested for faba beans (dry and green). Food and Agriculture Organization of the United Nations. 

  4. Gasim, S. and Link, W. (2007). Agronomic performance and the effect of soil fertilization on German winter faba bean. Journal of Central European Agriculture. 8(1): 121-127.

  5. Hamza, F.E.A. and Hussein, N.B. (2024). Assessment of genetic variability and heritability of elite faba bean (Vicia faba L.) landraces based on yield and yield-related characters in Northern Sudan. Net Journal of Agricultural Science. 12(2): 17-22. 

  6. Hawtin, G.C. and Hebblethwaite, P.D. (1983). Background and History of Faba Bean Production. In: The Faba bean (Vicia faba L.), [Hebblethwaite, P.D. (Ed.)], Butterworths. (pp. 3-22).

  7. Hiywotu, A.M., Abate, A., Worede, F. and Marefia, A. (2022). Genetic variability in Ethiopian faba bean (Vicia faba L.) accessions. Cogent Food and Agriculture. 8(1): 2132847. 

  8. Hiywotu, A.M., Abate, A. andWorede, F. (2023). Correlation and path coefficient analysis of yield and yield components of some Ethiopian faba bean (Vicia faba L.) accessions. Acta Agriculturae Slovenica. 119(1): 1-11.

  9. IMARC Group. (2025). Fava beans market: Global industry trends, share, size, forecast 2025-2033. IMARC Group. 

  10. Mahalanobis, P.C. (1936). On the generalized distance in statistics. Proceedings of the National Institute of Sciences of India. 2(1): 49-55.

  11. Martineau-Côté, D., Achouri, A., Karboune, S. and Hocine, L.L. (2022). Faba bean: An untapped source of quality plant proteins and bioactives. Nutrients. 14(8): 1541. 

  12. Mishra, U.K., Tripathi, R.M., Yadav, D., Kumar, R., Prajapati, P.K. and Muwel, K. (2023). Studies on correlation and path coefficient analysis in faba bean (Vicia faba L.). International Journal of Plant and Soil Science. 35(21): 362-368. 

  13. Olivoto, T. and Lúcio, A.D.C. (2021). MGIDI: Towards an effective multivariate selection in biological experiments. Bioinformatics. 37(10): 1383-1389. 

  14. Olivoto, T., de Souza, V.Q., Nardino, M., Karuppasamy, S. et al. (2022). MGIDI: A powerful tool to analyze plant multivariate data. Plant Methods. 18(121). https://doi.org/10.1186/ s13007-022-00952-5.

  15. R Core Team. (2020). R: A language and environment for statistical computing (Version 4.0) [Computer software]. R Foundation for Statistical Computing.

  16. Salem, S.A. (2009). Heterosis and combining ability in a diallel cross of eight faba bean (Vicia faba L.) genotypes. Asian Journal of Crop Science. 1: 66-76.

  17. Singh, A.K. and Bhatt, B.P. (2012a). Faba bean (Vicia faba L.): A potential leguminous crop of India. Rajendra Centre for ER.  pp 517.

  18. Singh, A.K. and Bhatt, B.P. (2012b). Faba bean: Unique germplasm explored and identified. Hort Flora Research Spectrum. 1(3): 267-269.

  19. Singh, A.K., Bhatt, B.P., Sundaram, P.K., Gupta, A.K. and Singh, D. (2013). Planting geometry to optimize growth and productivity of faba bean (Vicia faba L.) and soil fertility. Journal of Environmental Biology. 34(1): 117-122.

  20. Soliman, A.A., Ibrahim, M.A., Mousa, M.I. et al. (2024). Genetic potential and inheritance pattern of agronomic traits in faba bean under free and infested orobanche soil conditions. BMC Plant Biology. 24(301). https://doi.org/ 10.1186/s12870-024-05017-4.

  21. Toker, C. (2004). Estimates of broad-sense heritability for seed yield and yield criteria in faba bean (Vicia faba L.). Hereditas. 140(3): 222-225. 

  22. Upadhyay, D. and Pandey, K.K. (2025). Studies on genetic divergence in faba bean (Vicia faba L.). Indian Journal of Agricultural Research. 59(7): 1046-1050. doi: 10.18805/IJARe.A-6268.

  23. USDA. (2021). Food Data Central (Nutrient Database). U.S. Department of Agriculture. 

  24. Vered, Y., Grosskopf, I., Palevitch, D., Harsat, A., Charach, G., Weintraub, M.S. and Graff, E. (1997). The influence of Vicia faba (broad bean) seedlings on urinary sodium excretion. Planta Medica. 63(3): 237-240.

  25. Wickham, H. (2011). ggplot2. Wiley Interdisciplinary Reviews: Computational Statistics. 3(2): 180-185. 

  26. Zayed, E.M., Ghareeb, Z.E. and Saad, K.I. (2022). Genetic diversity and principal component analysis (PCA) of faba bean landraces based on yield-traits and protein SDS-PAGE. Journal of Global Agriculture and Ecology. 13(4): 1-16.

Identification of Superior Faba bean (Vicia faba L.) Accessions for Key Morphological and Yield-related Traits

Y
Yash Kumar Singh1,*
P
Parshuram Sial2
M
Manoj Kumar3
T
Thamaraikannan Sivakumar4
Y
Yogesh Kumar1
A
Ananya Singh5
V
V.P. Sahi6
1Department of Horticulture, Sam Higginbottom University of Agriculture, Technology and Sciences, Prayagraj-211 007, Uttar Pradesh, India.
2Regional Research and Technology Transfer Station, Odisha university of Agriculture and Technology, Semiliguda, Koraput-763 002, Odisha, India.
3Department of Plant Breeding and Genetics, Bihar Agricultural University Sabour-813 210, Bihar, India.
4Division of Genomic Resources, National Bureau of Plant Genetic Resources, New Delhi-110 012, India.
5Department of Plant Pathology, Sam Higginbottom University of Agriculture, Technology and Sciences, Prayagraj-211 007, Uttar Pradesh, India.
6Department of Genetics and Plant Breeding, Sam Higginbottom University of Agriculture, Technology and Sciences, Prayagraj-211 007, Uttar Pradesh, India.
  • Submitted18-09-2025|

  • Accepted25-12-2025|

  • First Online 30-12-2025|

  • doi 10.18805/LR-5573

Background: Faba bean (Vicia faba L.) is a multipurpose legume whose genetic improvement relies on exploiting existing morphological variation. Rigorous phenotypic evaluation provides insights into heritability, trait associations and divergence patterns, which are crucial for identifying superior genotypes and its selections in breeding programs.

Methods: Forty-five genotypes, including three checks, were evaluated across two Rabi seasons in a randomized block design for 20 agro-morphological traits spanning phenology, plant architecture, pod/seed attributes and yield. Best Linear Unbiased Estimates (BLUEs) were computed across replications and seasons.Broad-sense heritability. Genotypic variation, genetic divergence, D2 statistics, trait associations and the Multi-trait Genotype-Ideotype Distance Index (MGIDI) were applied to identify superior accessions.

Result: Analysis of variance showed significant genotypic variation for most of the traits, with high heritability. However, leaflet width and seeds per pod had moderate heritability. Mahalanobis D2 divided genotypes into four clusters. Cluster IV displayed strong yield traits and significant differences from Cluster I, indicating potential for hybrid vigour. Correlation analysis revealed positive relationships, such as maturity with pods per plant and pod length, along with some trade-offs. MGIDI found nine promising accessions that combine earliness, good pod and seed traits and appealing structure. Overall, these results confirm a wide range of phenotypic diversity and provide a clear set of accession for improving faba bean.

Faba bean (Vicia faba L.) is an annual legume belonging to the family Fabaceae. It is widely known by several names such as broad bean, horse bean, windsor bean, tick bean and bakela (Hawtin and Hebblethwaite, 1983). Global production of faba bean continues to rise, with production of approximately 6.77 million tonnes of dry beans and 1.81 million tonnes of green beans in 2022, cultivated over ~6.63 million and 0.64 million acres, respectively (FAOSTAT, 2024). Recent market forecasts further emphasize the growing demand. IMARC Group (2025) reported that global fava bean production reached 9.3 million tonnes in 2024 and it is projected to expand to 12.9 million tonnes by 2033, growing at a compound annual growth rate of 3.5%.
       
Although it is grown in limited areas in India and considered a minor crop, it holds promise due to its nutritional value, adaptability and potential role in crop diversification (Singh et al., 2012a; Singh et al., 2013). Faba bean is highly valued as a source of plant-based protein, with seeds containing 22-35% protein, 58% carbohydrates and 25% dietary fibre (USDA, 2021; Martineau-Côté et al., 2022). In addition to its nutritional role, the crop provides several bioactive compounds, including flavonoids, phenolics and L-DOPA (a precursor of dopamine), which have potential medicinal applications in the treatment of Parkinson’s disease (Vered et al., 1997). The crop also serves as an important source of food and feed in developing countries, while in developed nations it is mainly utilized as animal feed (Gasim and Link, 2007; Singh and Bhatt, 2012a).
       
Globally, more than 30,000 germplasm of faba bean are conserved in gene banks (Singh and Bhatt, 2012b). This genetic variability presents significant opportunities for crop improvement, but effective utilization requires a systematic characterization and evaluation. Morphological characterization remains the first and most practical step in germplasm evaluation, providing information on the extent of variation for traits of agronomic and breeding importance (Salem, 2009; Hiywotu et al., 2022). Grouping accessions and identifying important genotypes for trait-specific breeding are made easier by multivariate techniques like principal component analysis and cluster analysis Zayed et al. (2022) for instance, used PCA to identify the characteristics that contributed to significant phenotypic divergence in faba bean accession and to direct parent selection.
Plant material and field conditions
 
A total of 42 diverse faba bean accessions, along with three checks (HFB 1, HFB 2, Vikrant), were obtained from ICAR-NBPGR, New Delhi and evaluated for morphological traits. The trial was conducted in an RBD at the Horticultural Research Farm, SHUATS, Prayagraj, during rabi 2022-23 and 2023-24. Each accession was grown in six-row plots (1.5 × 2.0 m²), with five tagged plants per replication for data recording.
 
Data collection
 
In the present study, a set of 20 phenotypic traits was evaluated for faba bean to understand their breeding importance. All the traits have been characterized according to the DUS standard. These traits included: days of germination (DG), days to first flowering (DFF), days to 50% flowering (DF), number of branches per plant (NBP), number of nodes on the main branch (NNMB), number of flowers per plant (NFP, number), plant height (PH), leaflet length (LL), leaflet width (LW), number of pods per plant (NPP), pod length (PL), pod width (PW), number of seeds per pod (NSP), biological yield (BY), economical yield (EY), harvest index (HI), days to first pod formation (DPF), seed index (SI), days to maturity (DM) and yield per hectare (YPH).
 
Statistical analysis
 
For each trait recorded, data were analyzed for the Analysis of Variance (ANOVA) along with the significance levels of *p<0.05 (significant), **p<0.01 (highly significant) and ***p<0.001 (extremely significant) and Broad sense heritability using the agricolae package of R. Each replication in the two seasons comprised data collected from five plants per replication and the Best Linear Unbiased Estimates (BLUEs) values were computed using the lme4 package in R (Bates, 2010) for each genotype across the three replications. This approach minimized the environmental variation and provided unbiased estimates of each accession.
       
Based on the BLUE values estimated, summary statistics have been calculated using the agricolae package in R to verify the central tendency and trait performance across the accessions. Furthermore, the distribution and trends of each trait among the accessions were visualized using histograms overlaid with frequency curves, generated with the ggplot2 package in R (Wickham, 2011).
 
D2 clustering analysis
 
A D2 analysis was performed to group the accessions into various clusters based on their phenotypic BLUE values for all traits, enabling us to identify accessions with identical or distinct phenotypic characteristics. This clustering enables us to understand the nature of variation within and between accessions, which is essential in the breeding program for selecting parents. Phenotypic divergence of accessions was measured using a generalized distance (D2) proposed by Mahalanobis (1936), a multivariate distance that considers correlations among traits and accurately differentiates among accessions. Furthermore, intra and inter-cluster distances were determined to assess the relationships of clusters and the extent of dissimilarity among them to identify highly dissimilar accessions to hybridize and succeed in improving faba bean traits.
 
Correlation analysis
 
Correlation analysis determines the level of association between the traits under study, thereby establishing key conclusions. The 20 morphological traits were compared in pairs to produce the estimated pairwise correlations using the corr_plot function of the metan package in R (Olivoto and Lúcio, 2021). The analysis was performed to determine the strength and relationship of association between traits, to identify interdependent traits that may serve as indirect selection criteria during the breeding process. The significance of the correlations was also tested, with correlations referred to as significant (*p<0.05), highly significant (**p<0.01) and extremely significant (***p<0.001).
 
Selection of the best accessions
 
The Multi-Trait Genotype-Ideotype Distance Index (MGIDI) was calculated according to the procedure outlined by (Olivoto et al., 2022). The pre-determined desired ideotype trait performance used the BLUE values of each accession. Factor analysis was conducted to reduce multicollinearity and generalize trait variation and the genotype scores were projected in the reduced factorial space. The MGIDI of each accession was computed as the eclidean distance between the genotype and the ideotype, with lower values denoting a closer resemblance to the ideotype. All calculations were performed in R version 4.1.2 (R Core Team, 2020), utilizing the metan package (Olivoto and Lúcio, 2021) and visualized with ggplot2 (Wickham, 2011).
Analysis of variance and heritability
 
Analysis of variance revealed significant differences among accessions for most of the traits (DG, DFF, D50F, NBP, NNMB, NFP, PH, LL, NPP, PL, PW, BY, EY, HI, DPF, SI, DM, YPH). This confirms substantial genetic variation. LW and NSP showed moderate variation (P<0.05). Replication effects were non-significant (Table 1), indicating that differences were mostly due to genotypes.

Table 1: Anova table.


       
Reproductive traits (DFF, D50F, NFP, NPP) and morphological traits (PH, LL, PL, PW, HI) showed strong effects from genotypes, making them useful for selection. Broad-sense heritability was high (>60%) for most of the traits. LW had a moderate value of 38.1%, while NSP was also moderate; PW had the highest value at 98.2%. No traits fell in the low heritability range.
       
The frequency distribution (Fig 1) showed that most accessions yielded 32-36 g/plant, with only four exceeding 37 g/plant. Seventeen accessions flowered at 60-62 days, with four flowering earlier. Nine accessions produced the highest number of flowers (25-28) and seven matured early (~110 days), while most matured around 120 days.

Fig 1: Frequency histogram for the plant morphological traits for the faba bean accessions.


 
D2 clustering and cluster trait variation
 
D² cluster analysis grouped the 45 accessions into four clusters based on 20 traits (Fig 2). Cluster I included 13 accessions, such as IVT HB18-10, IVT HB19-10 and the check Vikrant. Cluster II had 11 accessions, including IVT HB19-12, AVT I HB16-15 and the check HFB-2 (Table 2). Both Cluster I and Cluster II showed lower average values for yield traits. Cluster III had 9 accessions, while Cluster IV contained 11 accessions. Clusters III and IV recorded higher averages, with Cluster IV surpassing the others in biomass yield, grain yield, flowering, maturity and harvest index. This highlights its potential for high yield (Fig 3).

Fig 2: D2 clustering of the faba bean accessions based on phenotypic traits.



Fig 3: Cluster wise variation of the traits in each cluster.



Table 2: List of genotypes under D2 clusters.


       
The variation within clusters was highest in Cluster IV (7.68) and Cluster II (7.66) and lowest in Cluster III (4.23). The largest differences between clusters happened between Cluster I and Cluster IV (7.68) and between Cluster I and Cluster III (6.91). The smallest difference was between Cluster II and Cluster I (4.3) (Fig 4). These results show clear differences in traits, with Cluster III being the smallest and Cluster IV exhibiting the most tremendous internal variation.

Fig 4: Intra and iter cluster dstance matrix.


 
Correlation
 
Correlation analysis of morphological characteristics in faba bean demonstrated that morphological characteristics were differently associated with flowering, pod and yield characteristics, as illustrated in (Fig 5). The days to first flowering had a strong positive and significant correlation with days to maturity (r = 0.45), but a negative and non-significant correlation with the pod weight (r = -0.45). Days to fifty percent flowering is positively correlated and highly significant with the biological yield (r= 0.99) and the economic yield (r = 0.66). There were positive and significant relationships between days to maturity and the number of pods per plant (r = 0.87), pod length (r = 0.91) and number of Seeds per pod (r = 0.59).

Fig 5: Correlation scatterplot for the faba bean morphological traits.


       
Pod length is negatively correlated with pod width (r = -0.02) and highly significantly correlated positively with the number of seeds per pod (r = 0.63) and seed index (r = 0.86). Pod width is negatively correlated with the number of pods per plant (r=-0.04). The number of seeds per pod is highly significant and positively correlated with the seed index (r = 0.86), as well as showing a positive correlation with both biological yield (r = 0.29) and economic yield (r = 0.15). Biological yield is positively correlated with yield per hectare (r = 0.07). Biological yield is positively correlated with the number of branches per plant (r=0.24), number of flowers per plant (r=0.15), leaf length (r = 0.18), leaf width (r=0.14) and number of pods per plant (r=0.16).
 
Selection of superior faba bean accessions
 
MGIDI analysis identified nine top accessions (GP RFBGP-42, GP RFBGP-79, HFB-1, RFB-48, GP FLRP-20, GP FLRP-31, GP RFBGP-44, GP FLRP-29, GP FLRP-23) that closely matched the ideal profile. These accessions combine early maturity, favorable pod and seed traits and strong plant structure (Fig 6). They are promising choices for yield stability and genetic improvement in breeding programs.

Fig 6: MGIDI analysis for the selection of superior faba bean accessions.


       
Principal component analysis (PCA) found that the first five principal components (PCs) explained 79.2% of the total variation, with PC1 contributing 44.3% (Table 3). Factor analysis (mean communality = 0.79; grouped traits into five factors: FA1 (DG, PL, NBP, NFP, NPP, SI) related to yield potential; FA2 (D50F, BY); FA3 (HI, EY); FA4 (NNMB, PW); and FA5 (LW, PH) (Table 4 and Fig 7).

Table 3: Principal component analysis (PCA) for MGIDI.



Table 4: Factor analysis (FA)-Factor loadings after rotation.



Fig 7: Contribution of traits to MGIDI across five factors in superior genotypes selection.


       
Selection differential analysis showed positive genetic gains in NBP, NFP, NPP, PL, SI, YPH, HI, PH and LW, which support breeding goals. In contrast, traits like DFF, DPF, DM, D50F and BY displayed negative differentials. The MGIDI rankings (Table 5) incorporated these patterns to prioritize accessions with the best trait combinations.

Table 5: Selection differential (SD) for MGIDI analysis.


 
Statistical significance
 
Faba bean accessions exhibited highly significant genotypic variations for most of the traits, highlighting the presence of substantial genetic variability within the studied panel-a fundamental requirement for selection and crop improvement. Similar findings were previously reported by Toker (2004), who consistently observed high heritability values for yield-related traits in diverse faba bean germplasm. The predominance of high heritability estimates (>60%) in this study suggests that many traits are primarily governed by additive genetic factors, making them suitable for direct phenotypic selection. Comparable results were also demonstrated by Hamza and Hussein (2024), who identified high heritability for plant height, pods per plant and seed weight in Sudanese faba bean landraces.
       
By contrast, traits with moderate heritability, such as leaflet width and number of seeds per pod, appear to be more environmentally influenced, necessitating evaluation across multiple environments and seasons. This aligns with the findings of Hiywotu et al., (2022), who reported that genotype × environment interactions play a critical role in shaping trait heritability and selection efficiency in Ethiopian faba bean accessions.

Clustering signifies the phenotypic trait variation
 
The clustering by Mahalanobis D2 revealed distinct groups of accessions, reflecting strong phenotypic influence that can be strategically exploited in breeding. Accessions from clusters separated by large inter-cluster distances are particularly valuable for crossing because they maximize the potential for heterosis and recovery of transgressive segregants. At the same time, clusters with higher intra-cluster variability provide opportunities for effective within-cluster selection. Comparable findings were reported by Dewangan et al., (2022), who observed clear cluster differentiation among faba bean lines, with distant clusters offering the best parental combinations for yield improvement. Similarly, Upadhyay and Pandey (2025) demonstrated that clusters with superior mean performance for seed yield and related traits are prime candidates for hybridization, thereby broadening the genetic base and enhancing productivity.

Correlation across the morphological traits
 
The patterns of correlation between flowering, maturity, pod characteristics and yield highlight the intricate relationship between faba bean productivity. In line with Mishra et al., (2023), who found that harvest index and biological yield had a significant direct impact on seed yield, positive correlations between reproductive timing and yield components imply that enhancing one can improve others. However, the existence of both positive and negative correlations highlights the importance of careful selection in balancing trade-offs. Hiywotu et al., (2023) reported similar results, showing that traits related to pods and seeds had a significant impact on grain yield through direct and indirect effects, indicating their usefulness as selection criteria.
 
Superior faba bean accessions selection
 
Using the MGIDI framework enabled simultaneous, balanced selection across phenology, yield and architecture while controlling for multicollinearity via factor analysis; by ranking genotypes on their distance to an agronomic ideotype, MGIDI prioritizes accessions that deliver multi-trait gains without over-weighting redundant, correlated traits (Olivoto and Lúcio, 2021; Olivoto et al., 2022). The analysis captured by the leading principal components together with high trait communalities supports the reliability of the multivariate signal underlying the MGIDI ranking; comparable patterns-where the first PCs explain most agronomic variation and clearly separate superior, often earlier and higher-yielding faba bean genotypes-are reported in recent faba bean multivariate studies (Soliman et al., 2024). In this context, positive selection differentials for pod-and seed-related traits alongside negative differentials for flowering and maturity are consistent with ideotype-based improvement that jointly targets sink capacity and earliness, as advocated in faba bean PCA-guided selection and in cross-crop MGIDI applications (Olivoto and Lúcio, 2021; Olivoto et al., 2022).
This study revealed substantial genetic variability among 45 faba bean genotypes for key morphological and yield-related traits, with high heritability supporting the effectiveness of direct phenotypic selection. Cluster and divergence analyses identified Cluster IV as a valuable source of superior genotypes. They highlighted inter-cluster combinations, particularly between Clusters I and IV, as promising for heterosis and broadening the genetic base. Correlation analysis confirmed pods per plant, pod length and seed index as reliable indirect selection traits, while weaker associations emphasized the importance of managing trade-offs in breeding. Multivariate approaches, including PCA and MGIDI, effectively integrated trait information and prioritized ideotype-proximal genotypes such as GP RFBGP-42, GP RFBGP-79 and HFB-1 RFB-48 for advancement. Collectively, these findings provide a clear framework for selecting parents and advancing promising lines.
The authors express their gratitude to, Department of Horticulture, SHUATS, Prayagraj, India, for the experimental field and academic support.
The authors declare that there are no conflicts of interest regarding the publication of this article. No funding or sponsorship influenced the design of the study, data collection, analysis, decision to publish, or preparation of the manuscript.

  1. Bates, D. (2010). Mixed models in R using the ime4 package. Part 3: Linear mixed models with simple, scalar random effects.  Madison (WI): University of Wisconsin-Madison. 

  2. Dewangan, N.K., Dahiya, G.S., Janghel, D.K. and Dohare, S. (2022). Diversity analysis for seed yield and its component traits among faba bean (Vicia faba L.) germplasm lines. Legume Research. 45(6): 689-694. doi: 10.18805/LR-4301.

  3. FAOSTAT. (2024). Global production and area harvested for faba beans (dry and green). Food and Agriculture Organization of the United Nations. 

  4. Gasim, S. and Link, W. (2007). Agronomic performance and the effect of soil fertilization on German winter faba bean. Journal of Central European Agriculture. 8(1): 121-127.

  5. Hamza, F.E.A. and Hussein, N.B. (2024). Assessment of genetic variability and heritability of elite faba bean (Vicia faba L.) landraces based on yield and yield-related characters in Northern Sudan. Net Journal of Agricultural Science. 12(2): 17-22. 

  6. Hawtin, G.C. and Hebblethwaite, P.D. (1983). Background and History of Faba Bean Production. In: The Faba bean (Vicia faba L.), [Hebblethwaite, P.D. (Ed.)], Butterworths. (pp. 3-22).

  7. Hiywotu, A.M., Abate, A., Worede, F. and Marefia, A. (2022). Genetic variability in Ethiopian faba bean (Vicia faba L.) accessions. Cogent Food and Agriculture. 8(1): 2132847. 

  8. Hiywotu, A.M., Abate, A. andWorede, F. (2023). Correlation and path coefficient analysis of yield and yield components of some Ethiopian faba bean (Vicia faba L.) accessions. Acta Agriculturae Slovenica. 119(1): 1-11.

  9. IMARC Group. (2025). Fava beans market: Global industry trends, share, size, forecast 2025-2033. IMARC Group. 

  10. Mahalanobis, P.C. (1936). On the generalized distance in statistics. Proceedings of the National Institute of Sciences of India. 2(1): 49-55.

  11. Martineau-Côté, D., Achouri, A., Karboune, S. and Hocine, L.L. (2022). Faba bean: An untapped source of quality plant proteins and bioactives. Nutrients. 14(8): 1541. 

  12. Mishra, U.K., Tripathi, R.M., Yadav, D., Kumar, R., Prajapati, P.K. and Muwel, K. (2023). Studies on correlation and path coefficient analysis in faba bean (Vicia faba L.). International Journal of Plant and Soil Science. 35(21): 362-368. 

  13. Olivoto, T. and Lúcio, A.D.C. (2021). MGIDI: Towards an effective multivariate selection in biological experiments. Bioinformatics. 37(10): 1383-1389. 

  14. Olivoto, T., de Souza, V.Q., Nardino, M., Karuppasamy, S. et al. (2022). MGIDI: A powerful tool to analyze plant multivariate data. Plant Methods. 18(121). https://doi.org/10.1186/ s13007-022-00952-5.

  15. R Core Team. (2020). R: A language and environment for statistical computing (Version 4.0) [Computer software]. R Foundation for Statistical Computing.

  16. Salem, S.A. (2009). Heterosis and combining ability in a diallel cross of eight faba bean (Vicia faba L.) genotypes. Asian Journal of Crop Science. 1: 66-76.

  17. Singh, A.K. and Bhatt, B.P. (2012a). Faba bean (Vicia faba L.): A potential leguminous crop of India. Rajendra Centre for ER.  pp 517.

  18. Singh, A.K. and Bhatt, B.P. (2012b). Faba bean: Unique germplasm explored and identified. Hort Flora Research Spectrum. 1(3): 267-269.

  19. Singh, A.K., Bhatt, B.P., Sundaram, P.K., Gupta, A.K. and Singh, D. (2013). Planting geometry to optimize growth and productivity of faba bean (Vicia faba L.) and soil fertility. Journal of Environmental Biology. 34(1): 117-122.

  20. Soliman, A.A., Ibrahim, M.A., Mousa, M.I. et al. (2024). Genetic potential and inheritance pattern of agronomic traits in faba bean under free and infested orobanche soil conditions. BMC Plant Biology. 24(301). https://doi.org/ 10.1186/s12870-024-05017-4.

  21. Toker, C. (2004). Estimates of broad-sense heritability for seed yield and yield criteria in faba bean (Vicia faba L.). Hereditas. 140(3): 222-225. 

  22. Upadhyay, D. and Pandey, K.K. (2025). Studies on genetic divergence in faba bean (Vicia faba L.). Indian Journal of Agricultural Research. 59(7): 1046-1050. doi: 10.18805/IJARe.A-6268.

  23. USDA. (2021). Food Data Central (Nutrient Database). U.S. Department of Agriculture. 

  24. Vered, Y., Grosskopf, I., Palevitch, D., Harsat, A., Charach, G., Weintraub, M.S. and Graff, E. (1997). The influence of Vicia faba (broad bean) seedlings on urinary sodium excretion. Planta Medica. 63(3): 237-240.

  25. Wickham, H. (2011). ggplot2. Wiley Interdisciplinary Reviews: Computational Statistics. 3(2): 180-185. 

  26. Zayed, E.M., Ghareeb, Z.E. and Saad, K.I. (2022). Genetic diversity and principal component analysis (PCA) of faba bean landraces based on yield-traits and protein SDS-PAGE. Journal of Global Agriculture and Ecology. 13(4): 1-16.
In this Article
Published In
Legume Research

Editorial Board

View all (0)