Channel: Freakonometrics » ACT2040

↧

GLM, non-linearity and heteroscedasticity

October 22, 2013, 5:00 pm

≫ Next: Pricing Reinsurance Contracts

≪ Previous: Modélisation des coûts individuels

Last week in the non-life insurance course, we’ve seen the theory of the Generalized Linear Models, emphasizing the two important components

the link function (which is actually the key component in predictive modeling)
the distribution, or the variance function

Just to illustrate, consider my favorite dataset

lin.mod = lm(dist~speed,data=cars)

A linear model means here

Image may be NSFW.
Clik here to view.

where the residuals are assumed to be centered, independent, and with identical variance. If we visualize that linear regression, we usually see something like that

Image may be NSFW.
Clik here to view.

The idea here (in GLMs) is to assume

Image may be NSFW.
Clik here to view.

which will produce the same model as the one describe previously, based on some error term. That model can be visualized below,

attach(cars)
n=2
X= cars$speed 
Y=cars$dist
df=data.frame(X,Y)
vX=seq(min(X)-2,max(X)+2,length=n)
vY=seq(min(Y)-15,max(Y)+15,length=n)
mat=persp(vX,vY,matrix(0,n,n),zlim=c(0,.1),theta=-30,ticktype ="detailed", box = FALSE)
reggig=glm(Y~X,data=df,family=gaussian(link="identity"))
x=seq(min(X),max(X),length=501)
C=trans3d(x,predict(reggig,newdata=data.frame(X=x),type="response"),rep(0,length(x)),mat)
lines(C,lwd=2)
sdgig=sqrt(summary(reggig)$dispersion)
x=seq(min(X),max(X),length=501)
y1=qnorm(.95,predict(reggig,newdata=data.frame(X=x),type="response"), sdgig)
C=trans3d(x,y1,rep(0,length(x)),mat)
lines(C,lty=2)
y2=qnorm(.05,predict(reggig,newdata=data.frame(X=x),type="response"), sdgig)
C=trans3d(x,y2,rep(0,length(x)),mat)
lines(C,lty=2)
C=trans3d(c(x,rev(x)),c(y1,rev(y2)),rep(0,2*length(x)),mat)
polygon(C,border=NA,col="yellow")
C=trans3d(X,Y,rep(0,length(X)),mat)
points(C,pch=19,col="red")
n=8
vX=seq(min(X),max(X),length=n)
mgig=predict(reggig,newdata=data.frame(X=vX))
sdgig=sqrt(summary(reggig)$dispersion)
for(j in n:1){
stp=251
x=rep(vX[j],stp)
y=seq(min(min(Y)-15,qnorm(.05,predict(reggig,newdata=data.frame(X=vX[j]),type="response"), sdgig)),max(Y)+15,length=stp)
z0=rep(0,stp)
z=dnorm(y, mgig[j], sdgig)
C=trans3d(c(x,x),c(y,rev(y)),c(z,z0),mat)
polygon(C,border=NA,col="light blue",density=40)
C=trans3d(x,y,z0,mat)
lines(C,lty=2)
C=trans3d(x,y,z,mat)
lines(C,col="blue")}

Image may be NSFW.
Clik here to view.

We do have two parts here: the linear increase of the average, Image may be NSFW.
Clik here to view. and the constant variance of the normal distribution Image may be NSFW.
Clik here to view..

On the other hand, if we assume a Poisson regression,

poisson.reg = glm(dist~speed,data=cars,family=poisson(link="log"))

we have something like

Image may be NSFW.
Clik here to view.

This time, two things have changed simultaneously: our model is no longer linear, it is an exponential one Image may be NSFW.
Clik here to view., and the variance is also increasing with the explanatory variable Image may be NSFW.
Clik here to view., since with a Poisson regression,

Image may be NSFW.
Clik here to view.

If we adapt the previous code, we get

Image may be NSFW.
Clik here to view.

The problem is that we changed two things when we introduced the Poisson regression from the linear model. So let us look at what happens when we change the two components independently. First, we can change the link function, with a Gaussian model but this time a multiplicative model (with a logarithm link function)

gaussian.reg = glm(dist~speed,data=cars,family=gaussian(link="log"))

Image may be NSFW.
Clik here to view.

which is still, here, an homoscedasctic model, but this time non-linear. Or we can change the link function in the Poisson regression, to get a linear model, but heteroscedastic

poisson.lin = glm(dist~speed,data=cars,family=poisson(link="identity"))

Image may be NSFW.
Clik here to view.

So this is basically what GLMs are about….

↧

Trending Articles

Vimeo 10.7.0 by Vimeo.com, Inc.

March 20, 2024, 9:50 pm

Dibujos para colorear de perros

September 11, 2013, 6:49 am

Long Distance Relationship Tagalog Love Quotes

February 19, 2014, 1:59 am

Kahit may Toyo ka

April 6, 2021, 7:03 pm

Re:Mutton Pies (lleechef)

March 1, 2015, 4:30 am

FORECLOSURE OF REAL ESTATE MORTGAGE

July 8, 2012, 6:30 am

Re: lwIP PIC32 port - new title : CycloneTCP a new open source stack for...

July 13, 2020, 10:33 am

Ka dewlynnong Nongkhnum, ka jaka ba itynnat tam ha West Khasi Hills

November 8, 2018, 8:10 pm

Vimeo 10.7.1 by Vimeo.com, Inc.

April 10, 2024, 9:16 am

Pahiyas 2013 sa Lucban, Quezon

May 16, 2013, 9:01 am

Autor: Adam

January 26, 2012, 6:34 am

Renos para colorear

December 22, 2013, 5:29 am

Two timer Sad tagalog Love quotes

February 22, 2014, 1:15 am

RE: Mutton Pies (mely)

February 4, 2009, 12:00 pm

Mga Tala sa “Unang Siglo ng Nobela sa Filipinas” (2009) ni Virgilio S. Almario

May 15, 2013, 9:01 am

Re: lwIP PIC32 port - new title : CycloneTCP a new open source stack for...

June 26, 2017, 12:50 am

Ka longiing longsem kaba skhem bad kaba khlain ka pynlong kein ia ka...

October 28, 2018, 9:42 pm

作者：人与自然和谐相处的世外桃源 - 加拉帕戈斯群岛 Galapagos Islands 游记 - 美国信用卡指南

May 26, 2018, 8:32 am

Vimeo 10.12.2 by Vimeo.com, Inc.

October 12, 2024, 8:51 am

PREMATURE CAMPAIGNING – Meron ba nun?

November 2, 2012, 7:48 pm

© 2025 //www.rssing.com