Loading...
Thumbnail Image

Theses

Browse

Search Results

Now showing 1 - 9 of 41
  • ThesisItemOpen Access
    Fitting linear mixed effects models for unbalanced longitudinal data
    (CCSHAU, Hisar, 2020-08) Ravita; Verma, Urmil
    The classical linear regression model is an important statistical tool but its use is limited because of its standard assumptions. Regression models using time series data occur quite oftenly and the assumption of uncorrelated or independent errors is often not appropriate. Moreover, many time series having complex structure calls for the addition of fixed and random effects accounting for the observational design. Such effects are straightforward to add in a mixed model environment (accommodating unbalanced data). The fixed-effects parameters can be either qualitative (as in the traditional analysis of variance) or quantitative (as in standard linear regression). BLUP is a standard method for estimating random effects of a mixed model. The mixed procedure uses the REML method, also known as residual maximum likelihood. It is here that the Gaussian assumptions are exploited. One such class is varying coefficient models, where the response variable is allowed to depend linearly on some regressors, with coefficients as smooth functions of some other predictor variables, called the effect modifiers. Varying coefficient models, where the effect modifier variable is calendar time, leads to time-varying coefficient models. The statistical modelling approaches viz., multiple linear regression and linear mixed effects were applied to develop mustard yield forecasts models on agro-climatic zone basis in Haryana. The mustard yield data for the period 1980-81 to 2016-17 of Hisar, Bhiwani, Sirsa, Mahendragarh and Gurugram, 1989-90 to 2016-17 of Rewari and 1997-98 to 2016-17 of Jhajjar and Fatehabad districts alongwith fortnightly weather data were used for the purpose. The zonal yield forecast models have been developed on the basis of time-trend and weather data from 1980-81 to 2009-10 while the data from 2010-11 to 2016-17 were used for validity checking of the developed models. Trend yield/time variable was included to take care of variation between districts within zone as the weather data are not available for all the districts, though the zonal model utilized the same weather information in adjoining districts under the zone. The linear mixed effects models with time both as fixed and random effects and weather as random effects with covariance structures; VC, AR(1) and Toeplitz have been fitted. The post-sample predictive performance(s) of alternative LMMs and regression based weather-yield models were observed in terms of percent relative deviations from real-time yield(s) and root mean square error(s), and that differed markedly among the alternative models. LMMs with weather as random effect(s) consistently showed the superiority over regression based weather-yield models in capturing lower percent relative deviations. The LMMs with weather as random effects performed well with lower error metrics as compared to the alternative mixed effects/regression models in most of the post-sample time regimes. Sevensteps ahead (i.e. 2010-11 to 2016-17) predicted values favour the use of LMMs. A critical in-depth of the results indicates the preference of using varying coefficients models in comparison to conventional, i.e., constant/fixed coefficients models developed under this empirical study. The linear mixed effects models with Toeplitz type structure substantially improved the predictive accuracy and produced what can be considered as satisfactory district-level mustard yield prediction in Haryana.
  • ThesisItemOpen Access
    Time series intervention modeling and simulation for mustard yield forecasting in Haryana
    (CCSHAU,HiSAR, 2020-10) Ajay Kumar; Verma, Urmil
    Modeling and Simulation is a discipline for developing a level of understanding of the interaction of the parts of a system, and of the system as a whole. A model is a simplified representation of a system at some particular point in time or space intended to promote understanding of the real system. Simulation permits the evaluation of operating performance prior to the implementation of a system. The study compares the efficacy of time series Intervention models and simulation in quantifying the pre-harvest mustard yield in Hisar, Bhiwani, Sirsa, Fatehabad, Mahendragarh, Rewari, Jhajjar and Gurugram districts of Haryana. The objective of this study was to assess the forecast accuracy of the contending models for district-level mustard yield forecasts in Haryana. The fortnightly weather data on rainfall, minimum temperature and maximum temperature over the crop growth period (September-October to February-March) have been utilized from 1980-81 to 2010-11 for the models‟ building. The weather-yield data from 2011-12 to 2015-16 have been used to check the post-sample validity of the fitted models for mustard yield forecasts in comparison to those obtained from State Department of Agriculture crop yield(s) estimates. The statistical modeling approaches viz., multiple linear regression, ARIMA, regression with ARIMA errors (RegARIMA) and ARIMA-Intervention were applied for the purpose. First of all, weather-yield models based on multiple linear regression were developed to relate mustard yield to fortnightly weather input alongwith linear time-trend yield/crop condition term as an indicator variable.Alternatively, ARIMA, RegARIMA, and ARIMA-Intervention models were fitted as per targeted objectives. Additionally, Student‟s t-copula in SAS is applied as a simulation tool and compared the output to the time series forecasts. The forecasts are compared to determine if there is either a consistent or significant difference between the two output. The forecast performance(s) of the alternative models were observed in terms of percent relative deviations of mustard yield forecasts from observed yield(s) and root mean square error(s). RegARIMA models performed well with lower error metrics as compared to the alternative models in most of the time regimes. Five-steps ahead forecast figures i.e. 2011-12 to 2015-16 favour the use of RegARIMA models to obtain pre-harvest mustard yield forecasts in the districts under study. The forecasts generated by RegARIMA are remarkably close to the forecasts obtained through the simulation process. Empirical evidence from this study confirms that the RegARIMA model can produce reliable forecasts and would therefore provide a more robust approach of forecasting with limited data sets.using the developed forecast models, the district-level mustard yield estimates could be computed successfully well in advance of the actual harvest. On the other hand, the State Department of Agriculture crop yield estimates are obtained quite late after the actual crop harvest.
  • ThesisItemOpen Access
    State Space Modelling with Weather as Exogenous Input for Sugarcane Yield Prediction in Haryana
    (CCSHAU, Hisar, 2020-05) Hooda, Ekta; Verma, Urmil
    Parameter constancy is a fundamental issue for empirical models to be useful for forecasting, analyzing or testing any theory. This work addresses the concept of parameter constancy and the implications of predictive failure. Predictive failure is uniquely a post-sample problem. Unlike classical regression analysis, the state space models are time varying parameters models as they allow for known changes in the structure of the system over time and provide a flexible class of dynamic and structural time series models. The study has been performed in two parts i.e. the development of state space models in two forms (the state space and unobserved component approach), and the state space models with weather as an exogenous input for sugarcane yield prediction in Ambala, Karnal, Kurukshetra, Panipat and Yamunanagar districts of Haryana. The time series sugarcane yield data for the period 1966-67 to 2009-10 of Ambala and Karnal districts, 1972-73 to 2009-10 of Kurukshetra and 1970-71 to 2009-10 of Panipat and Yamunanagar districts were used for the development of different models. The validity of fitted models have been checked for the subsequent years i.e., 2010-11 to 2016-17, not included in the development of the yield forecast models. The selection of autoregressive orders, i.e., five, three, two, four and five looked reasonable for Ambala, Karnal, Kurukshetra, Panipat and Yamunanagar districts respectively helped in determining the amount of past information to be used in the canonical correlation analysis and further leading to the selection of state vector. Information from the canonical correlation and preliminary autoregression analyses were used to form preliminary estimate of the parameters of state space models and that provided the sugarcane yield estimates using Kalman filtering technique. The UCMs with level, trend and irregular components were fitted to study the trend of sugarcane yield. For all the five districts, the irregular component was found to be highly significant while both level and trend component variances were observed non-significant. Lastly, the state space models with weather as exogenous input using different types of growth trends viz., polynomial splines; PS(1), PS(2) and PS(3) were developed. The weather variables used for each district were selected on the basis of stepwise regression method and PS(2) with weather input was selected as the best suited model for all districts. The post-sample sugarcane yield estimates were obtained on the basis of fitted SS, UCM and SSM with exogenous input. The predictive performance(s) of the contending models were observed in terms of percent relative deviations and RMSEs of sugarcane yield forecasts in relation to observed yield(s). The SSMs with weather input consistently showed the superiority over SS and UCM models in capturing lower percent relative deviations. Thus, it is inferred that the state space models may be effectively used pertaining to Indian agriculture data, as it takes into account the time dependency of the underlying parameters which may further enhance the predictive accuracy of time-series models with parameter constancy.
  • ThesisItemOpen Access
    Variable selection for classification and discrimination of Indian Mustard (Brassica juncea) genotypes for yield and oil content
    (CCSHAU, Hisar, 2019-07-10) Godara, Poonam; Hooda, BK
    The present study deals with the problem of variable selection for classification and discrimination of Indian Mustard (Brassica juncea) genotypes for yield and oil content. The study used secondary data on 310 Indian mustard genotypes obtained from Oilseeds section of the department of Genetics and Plant Breeding, CCS HAU, Hisar. The experiment was conducted during rabi season of 2015-16. Five variable selection methods (Univariate Two-Sample t-test, Rao´s F test for Additional Information, STEPDISC Procedure (backward and forward) using Wilk´s Lambda criterion and Random Forests Algorithm) for classification and discrimination were compared using Monte Carlo simulation. Performance of the methods was assessed in terms of leave one out cross validation error for classification. Comparing the performance of various methods affecting seed yield for samples of equal sizes in scheme I, Rao's F test, Wilkˊs lambda (Backward) and Wilkˊs lambda (Forward) were found better than others. In scheme II, the most suitable methods affecting oil content with least leave one out cross validation error rate were Wilkˊs lambda (Backward) and Wilkˊs lambda (Forward). Based on results of the scheme I and II, Wilk´s Lambda (backward and forward) were found most suitable method for classification affecting the seed yield and oil content significantly. In scheme I using leave one out cross validation error rate four important variables for discrimination affecting the seed yield per plants were secondary branches, primary branches, days to maturity and siliqua number on main shoot with least error of rate of 21.72 per cent. The important variables for discrimination which significantly affected the oil content were siliqua length, Secondary branches, primary branches and days to maturity with least error rate of 33.90 per cent. Secondary branches, siliqua number on main shoot, seeds per siliqua and 1000 seed weight were found to be important variables in scheme III with least error rate of 27.68 per cent. Three characters which discriminate the groups having low seed yield and high seed yield were 1000 seed weight, siliqua length and seeds per siliqua, while siliqua length 1000 seed weight and primary branches were found the most discriminating variables affecting oil content. Using the correlation between variables and discriminant score, the most important variables affecting the seed yield were secondary branches, primary branches and days to maturity. The three most important variables discriminating between oil content were siliqua length, secondary branches and seeds per siliqua. Most important variables discriminating between low seed yield with low oil content and high seed yield with high oil content groups were secondary branches, primary branches and siliqua number of main shoot. The variable, number of secondary branches have been found to be the most important for classification and discrimination of Indian mustard genotypes for seed yield and oil content.
  • ThesisItemOpen Access
    Multidimensional analysis of poverty in Haryana: A fuzzy set approach
    (CCSHAU, 2018) Tanwar, Nitin; Hooda, B.K.
    The present investigation was carried out to measure aspect based multidimensional poverty in Haryana. The necessary data for the study was obtained from the consumer expenditure survey (68th round conducted in 2011-12 and 69th round conducted in 2012) of NSSO on drinking water, sanitation, hygiene and housing conditions. The Multidimensional Poverty Index (MPI) suggested by Alkire & Foster (2011) using the dual cut-off method based on the counting approach has been applied for poverty estimation in rural and urban areas of Haryana. The Totally Fuzzy and Relative Approach due to Costa and Angelis (2008) have been used to measure multidimensional poverty in Haryana. Univariate techniques for poverty measurement such as Head Count Ratio (HCR), Income Gap Ratio (IGR) and Poverty Gap Ratio (PGR) based on monetary data have also been used to estimate the proportion of deprived households at district levels in Haryana. The HCR indicated that the districts of Mewat and Fatehabad have maximum proportion of the poor households in rural Haryana while, the districts of Mewat and Yamuna Nagar have the maximum proportion of poor households in urban Haryana. The districts of Jhajjar, Gurgaon, Sonipat and Karnal have the minimum proportion of the poor households in rural Haryana while the districts of Hisar, Fatehabad and Gurgaon have the minimum proportion of poor households in urban Haryana. The maximum PGR has been observed in the districts of Fatehabad, Yamuna Nagar and Mewat in rural Haryana while the urban households in the districts of Mewat and Yamuna Nagar have the maximum poverty gap ratio. The fuzzy MPI based on the aspects of drinking water facilities, sanitation facilities and housing conditions for Haryana indicated that 33.28% households in overall Haryana are multidimensionally poor with 36.64% households in rural and 30.46% in urban Haryana. The decomposition of the households by social groups indicated that there is not much difference in multidimensional poverty index values among households related to schedule castes (SC), other backward classes (OBC) and others. The index values varied from 30.49 to 34.24 per cent among the social groups. Using the Alkire-Foster aspect based MPI, it was observed that the rural households in the districts of Mewat, Panipat, Mahendragarh, Rohtak, Gurgaon and Palwal have high MPI values indicating high level of poverty or deprivedness in these districts. Similarly the households in urban areas of the districts of Mewat, Panipat, Jhajjar, Rohtak and Mahendragarh were found multidimensionally poor as indicated by high MPI values.
  • ThesisItemOpen Access
    On Construction and Analysis of Bio-assay Designs
    (College of Basic Sciences and Humanities Chaudhary Charan Singh Haryana Agricultural University Hisar, 1984) Rai, Lajpat; Puri, P. D.
  • ThesisItemOpen Access
    Regression Method with Dummy Variables in Linear Models for Unbalanced Data
    (College of Basic Sciences and Humanities Chaudhary Charan Singh Haryana Agricultural University Hisar, 1983) Goswami, Ram Prasad; Singh, Umed
  • ThesisItemOpen Access
    Farmers Perceptin abut the Coverage and the Utility of Agricultural News in Leading Tamil Dailies
    (College of Agriculture Chaudhary Charan Singh Haryana Agricultural University Hisar, 1981) Hermina, J. Queeni; Varma, N. S
  • ThesisItemOpen Access
    Some contributions to model selection strategies in regression analysis
    (Department of Mathematics And Statistics College of Basic Sciences and Humanities Punjab Agricultural University, 2000) Sharma, Saroj Kumari; Cheema, H. S