Advertisements

Mathematics Project Topics

A Study on Fractional Polynomial Regression

A Study on Fractional Polynomial Regression

Advertisements

A Study on Fractional Polynomial Regression

Chapter One

ย Purposeย of theย Study

Most of the existing method on fractional polynomial models focused on fitting modelsย toย psychologicalย andย pharmacokineticย experimentalย data.ย Littleย hasย beenย doneย onย agronomic data although; Nelder (1966) introduced and applied the inverse polynomialย model on fertilizer trials, while Salawu (2007) applied the inverse polynomial model atย quadraticย variable on fertilizer response ofย threeย riceย varieties.

This research focuses on fitting all the power set of a fractional polynomial model on Pmain aim is to observe how well the fractional polynomial model fit the data using normal errors regression analysis when the covariates are continuous or are grouped.

CHAPTER TWO

ย LITERATUREย REVIEW

This chapter seeks to review literature on the works of different scholars with regards toย techniques applied in areas of conventional polynomial regression models, fractionalย polynomial regression model andย issues of assessingย modelย adequacy.

Polynomialย Regressionย Modelsย forย Continuousย Covariates

Aย basicย choiceย inย modelingย isย betweenย parametricย andย non-parametricย models.ย Parametric models such as polynomials are easy to fit and the risk function may beย written down concisely, but they may fit the data badly and give misleading inference.ย On the other hand, non-parametric models may fit the data well but difficult to interpretย due to fluctuations in the fitted curves. The risk function is usually impossible to writeย down concisely (Royston et al., 1999). Polynomial regression entails an inherent trade- o๏ฌ€ between accuracy and e๏ฌƒciency. As the degree of the polynomial increases, the accuracy of the model increases up to a certain point, however the time and space needed increases as well (Stronger and Stone, 2006). Bremer (2012) stated that in practice, we usually start with models of degree one, and if transformations on the predictor or the response are insufficient then we consider models of degree two. Higher degree models should be avoided unless the context from which the data is coming explicitly calls for one of these models. To decide on the appropriate degree of a polynomial regression model, two different strategies are possible. One can start with a linear model and include higher order terms one by one until the highest order term becomes non-significant. This method is generally called Forward Variable selection. On the other hand one could start with a high order model and exclude the non Bsignificantย highestย orderย termsย oneย by oneย untilย theย remaining highestย orderย termย becomesย significant.ย Thisย methodย isย generallyย referredย toย asย Backwardย Variableย selection. In general, the two methods may not lead to the same model. For polynomialย models,ย theseย methodsย areย likelyย overpowered,ย sinceย weย canย restrictย ourย attentionย toย first and second order polynomial models (Bremer, 2012). It is possible to select theย predictorย functions more carefullyย asย curveย linear functions of Xย toย avoid thisย problem.

One problem that is always encountered in regression model building is nonlinearity inย theย relationย betweenย theย outcomeย variableย andย continuousย orย orderedย predictors.ย Traditionally, such predictors are entered into stepwise selection procedures as linearย termsย orย asย dummyย variablesย obtainedย afterย grouping,ย thoughย theย assumptionย ofย linearity may be incorrect (Royston and Sauerberi, 2008). Categorization introducesย problems of defining cutpoint(s) (Altman et al., 1994), overparametrization and loss ofย efficiency; Lagakos (1988). In any case, a cutpoint model is an unrealistic way toย describeย aย smoothย relationshipย betweenย aย predictorย andย aย responseย variable.ย Anย alternativeย approachย isย toย keepย theย variableย continuousย andย allowย someย formย ofย nonlinearity. Hitherto, quadratic or cubic polynomials have been used, but the range ofย curve shapes afforded by conventional low-order polynomials is limited (Royston andย Sauerbrei, 2008). Box and Tidwell (1962) proposed a method of determining a powerย transformation of a predictor. A moreย general family of parametric models, proposedย by Royston and Altman (1994), is based on fractional polynomial (FP) functions andย can be traced back to Box and Tidwellโ€˜s (1962) approach. Royston and Altman (1994)ย presented the FP functions which encompass conventional polynomialsย as a specialย case where one, two or more terms of the form xpย are fitted, the exponentโ€˜s p beingย chosenย from a small, preselected setย of integerย and non-integer values.

Forย non-parametricย regressionย andย scatterย plotย smoothingย areย otherย methodsย inย modeling continuous covariates other than linear and FP functions. For a function of xย with the global-influence property, the fit at a given value x0ย of x may be relativelyย unaffected by local perturbations of the response at x0, but the fit at points distant to x0ย may be affected, perhaps considerably. This property may be regarded by proponents ofย local-influence models as a fatal flaw (Royston and Sauerbrei, 2008). Conventionalย polynomialย regressionย isย aย popularย nonparametricย regressionย techniqueย dueย toย itsย attractive asymptotic properties, in particular at the border of the support. For fullyย observed responses, a local polynomial regression estimate of m(x0) is obtained byย estimatingย aย polynomialย inย xย withย weightedย ordinaryย leastย squares.ย Eachย unitย isย weighted depending on its distance in x to the design point of interest (focal value) x0,ย thereby making the procedure local (Karlsson et al., 2009). According to Royston andย Sauerbrei (2008), a rigorous definition of the global-influence property has not beenย framed,ย butย suchย modelsย areย usuallyย parametricย inย nature.ย Examplesย includeย polynomials,ย nonlinearย modelsย suchย asย exponentialย andย logisticย functions,ย andย fractionalย polynomialsย developedย byย Roystonย andย Altmanย (1994).ย Byย contrast,ย functionsย withย theย local-influenceย property,ย includingย regressionย splinesย (deย Boor,ย 2001), smoothing splines (Green and Silverman, 1994), and kernel-based scatter-plotย smoothers such as locally weighted scatter plot smoothers โ€–LOWESSโ€– (Cleveland andย Devlin, 1988), are typically nonparametric in character. Perturbation of the response atย x0ย usually greatly affects the fit at x0ย but hardly affects it at points distant to x0. One key argument favoring functions with global influence is their potential for use in future applications and datasets (Royston and Sauerbrei, 2008). Without such an aim, functions with local influence might appear the more attractive (Hand and Vinciotti, 2003). According to Royston and Sauerbrei (2008) fractional polynomial functions retain the global-influence property; they are much more flexible than polynomials.ย Further, they stated that low-dimensional fractional polynomial curves may provide aย satisfactoryย fitย whereย high-orderย polynomialsย failย (Roystonย andย Altman,ย 1994).ย Fractional polynomials are intermediate between polynomials and nonlinear curves.ย They may be seen as a good compromise between ultra-flexible but potentially unstableย local-influence models and the relatively inflexible conventional polynomials (Roystonย andย Sauerbrei, 2008).

 

CHAPTER THREEย 

METHODOLOGY

Advertisements

This chapter seeks to explain the methodology used in the research. The study going toย presentย theย fractionalย polynomialย forย normalย errorย regressionย models,ย theย medianย method for categorizing continuous covariates and the deviance method for parameterย estimationย and checkingย adequacyย of theย fitted model.

Normalย Errorย Model

Forย anย individualย withย responseย y, the multiple linear regression model with normal errors

~ย N(0 2ย )ย andย covariateย vectorย Xย =ย (ย x1ย ,ย x2ย ,…,ย xkย )ย withย kย variables,ย mayย be

writtenย asย y

(3.1)

Theย linearย predictorย orย โ€—indexโ€˜, Xย is an important quantity in multivariable modelingย andย equationย (3.1)ย isย calledย theย normalย errorย modelย (Roystonย andย Sauerbrei,ย 2008).

CHAPTERย FOUR

ย ANALYSISย ANDย DISCUSSION OFย RESULTS

ย Introduction

This chapter seeks to present results from the analyses of data and interpretation ofย results for the normal error fractional polynomial regression for the data sets describedย inย chapterย threeย andย presentedย inย theย appendix.ย Theย dataย setย wasย analyzedย asย aย generalizedย linear model (GLM), usingย two different approaches.

CHAPTERย FIVE

SUMMARY,ย CONCLUSIONย ANDย RECOMMENDATION

ย Introduction

This chapter presents the summary, conclusion and recommendations based on theย resultsย obtained in chapter four.

ย Summary

The main objective of this research work is to fit a fractional polynomial regressionย model with continuous covariate and grouped covariate. The essence is to compare theย performance of the fit for continuous covariate and grouped covariates. The generalizedย linear model was fitted for the fractional polynomials. Two different approaches wereย used. The first approach is Royston and Altman method, while the second approach isย ordinaryย fractional polynomials fit.

Fromย theย fittedย fractionalย polynomialย regressionย modelsย itย wasย observedย thatย theย median algorithm method for grouping continuous covariates that was proposed gave aย better results compared to the continuous covariate. For the experimental design data,ย the effect of nitrogen fertilizer, manure and cowpea variety on cowpea yield presentedย in table 4.1 through table 4.10 when MFP regression proposed by Royston and Altmanย was applied the algorithmย for selection factorsย withย significantย effectsย convergedย atย ฯ†(1,,ย 1) with final deviance of 127.97, both fertilizer and manure rates are significant atย 5% level with P-value of 0.029 and 0.001 respectively. On the other hand variety wasย not significant at 5% level. When fertilizer was considered as an independent variableย oneย i.eย (x1) , the algorithm for the selection of factors with significant effects converged at ฯ†(x1,ย 3) with model terms deviance of 127.08.The model for selection of factors withย significant effects converged at ฯ†(x2,ย -2) with model terms deviance of 130.17 for theย manure.

Conclusion

Based on the observations above, the study conclude that the median grouping methodย (polytomous) for grouping continuous covariate did not performed badly, since it gaveย most significant result compare to ungrouped and grouped (dichotomous) continuousย covariates.ย Forย theย Fractionalย polynomialsย regression,ย theย continuousย covariatesย produced the gain (G) of 3.09.When multivariable fractional polynomials regressionย was used, the gain (G) of 6.20 and 25.85 were produced. In fact, most data analystย alwaysย groupedย theirย treatmentย levelsย beforeย analysisย exceptย otherwise.ย Thereforeย grouping could be done adequately depending on the method one obtained his cutpointย orย carryingย out theย groupingย of the continuous covariate.

ย Recommendation

Basedย onย our observationsย fromย chapterย four, theย studyย recommends theย following:

  • Fractional polynomial regression model fit the data well because it is open in thesense that a set of pre-defined powers are available which the best powers amongย otherย contendingย powersย can be
  • Theuseย ofย FPย modeling inย experimentalย designย dataย too,ย sinceย programsย forย interaction effect have been developed in latest versions of STATA, though, wasย notย studied in thisย research work so asย to extend its
  • The median algorithm method of grouping continuous covariate is recommendbecauseย inย thisย researchย workย itย showedย aย contendingย strengthย withย covariateย thatย isย not grouped.

Contributionย toย Knowledge

Theย contributionsย to knowledgeย from thisย research workย areย asย follows;

  1. The comparison between fractional polynomial regression model withcontinuousย covariateย andย grouped covariateย is achieved.
  2. Medianalgorithmย methodย ofย groupingย continuousย covariatesย has
  3. Fittedaย fractionalย polynomialย regressionย modelย inย analyzingย experimentalย designย dataย has been successfully

Suggestionย forย Furtherย Research

Extensionย ofย fractionalย polynomialย regressionย modelย inย fittingย experimentalย designย dataย withย interaction effects isย suggested.

Advertisements

WeCreativez WhatsApp Support
Our customer support team is here to answer your questions. Ask us anything!