統計学輪講(第15回)


日時	  2001年7月10日(火)    15時00分〜16時40分
場所	  経済学部5階視聴覚室
講演者    田邉 國士 (統計数理研究所)  
演題     Penalized Logistic Regression Machines

概要
Support Vector Machines(Vapnik,1979,'95,'98) have been recognized as 
powerful method for  learning certain structures from data for prediction. 
Their success is due to intrinsic combination of the quadratic programming 
models with {\it Kernel Method}. The machine, however, does not seem to 
accomodate very well the cases where the mechanism of generating data is 
largely of stochastic nature. By employing the penalized logistic regression 
model, we make a statistical attempt to construct multiclass discrimination 
machines  which can handle much noisier stochastic data to be competetive 
with SVM in such an environment. It is shown that {\it by penalizing the 
likelihood in a specific way, we can intrinsically combine the logistic 
regression model with the kernel methods.} In particular, {\it a new class 
of penalty functions and associated normalized projective kernels are 
introduced to gain a versatile induction power of our learning machines.}  
The closed formulas are given for the first and second derivetives of the 
log penalized logistic regression likelihood, whose Hessian matrix is shown 
to be positive definite and uniformly bounded.  {\it Dual classes of globally 
convergent} learning machines(algorithms) are given for obtaining the optimal 
parameters for both probabilistic and deterministic prediction. Analysis  of 
the rate of  convergence is given for each class of machines. The type-I 
(or marginal) likelihood and Generalized Information Criteria are also given 
in closed form for determining the optimal value of hyperparameters in the 
model so that the machines have a {\it due induction capacity to the size and
the quality of an available training data set}.

Key words: Prediction, Multiclass Discrimination, Penalized Logistic 
Regression, Neural Network, Kernel Method, Normalized Projective Kernel,  
Dual Learning Machines, Induction Capacity, Type-{\rm II} Likelihood, 
Marginal Likelihood, Generalized Information Criterion


統計学輪講のスケジュールに戻る.