predict risk 副本

PREDICTING RISK FROM FINANCIAL

REPORTS WITH REGRESSION

In Association for Computational Linguistics 2009.

Human Language Technologies, pp.272–280

Authors: Kogan, S., Levin, D., Routledge, B.R., Sagi, J.S., and

Smith, N.A.

Presenter: Chen QingZhi

Date : 20120315

OUTLINES

1.Introduction

2.Stock Return Volatility

3.Problem Formulation

4.Dataset

5.BaseLine and Evaluation Method

6.Experiments

7.Conclusion

INTRODUCTION

INTRODUCTION

4.Our motivation and task:1)In real world, people use financial report to predict the

financial risk of investment of that company by human

experience.

2)Given some financial reports ,we try to automatically

predict a continuous quantity known as stock return

volatility which is an measurement of financial risk

5.The output variable in this work is

uncontroversial and resources(both text and

volatility record) are easy to obtain.

STOCK RETURN VOLATILITY

STOCK RETURN VOLATILITY

3.Why volatility?We are trying to predict how stable its price will be over a

future time period , especially one year

4.Volatility is easier predicted than stock

performance and not subject to any kind of

human expertise and disagreement.

PROBLEM FORMULATION

PROBLEM FORMUALTION

3. SVR is a well-known method for training a

regression model

where C is a regularization constant and controls the training

error

PROBLEM FORMULATION

In terms of kernel function:

Then use this formula to solve W

DATASET

1.Form 10-K: mandated by US Securities

Exchange Commission

2.Subsection 7A: quantitative and qualitative

disclosures about market risk.So we filter other sections from the reports and keep the

most important part

3.For some reasons , not all of the documents

pass the filter at all: bankrupt , delist …

DATASET

Table 1: Dimensions of the dataset used in this paper , after filtering and

tokenization.

The near doubling in average document size during 2002–3 is possibly due to the

passage of the Sarbanes-Oxley Act of 2002 in the wake of Enron’s accounting

scandal (and numerous others).

DATASET

REPORT SAMPLE:

The following discussion and analysis of ABC’s consolidated financial condition and consolidated results of operation should be read in conjunction with ABC’s Consolidated Financial Statements and Notes thereto included elsewhere herein. This discussion contains certain forward-looking statements which involve risks and uncertainties . ABC’s actual results could differ materially from the results expressed in, or implied by, such statements. See “Regarding Forward-Looking Statements.”

DATASET

DATASET

Data preparation :

Tokenization was applied to the text, including

punctuation removal, down casing, collapsing

all digit sequences, and heuristic removal of

remnant markup

BASELINES AND EVALUATION METHOD

BASELINES AND EVALUATION METHOD

Measurement is mean squared error:

EXPERIMENTS

EXPERIMENTS

Objective representation:

EXPERIMENTS

RESULTS

Table 2: MSE (Eq. 6) of different models on test data predictions. Lower values are better. Boldface denotes improvements over the baseline, and denotes significance compared to the baseline under a permutation test (p <0.05).

EFFECTS OF SARBANES-OXLEY

Sarbanes-Oxley Act of 2002, which sought to

reform financial reporting, had a clear effect on

informativity.

RECENT DATA IS MORE IMPORTANT

INTERPRET WEIGHTS

Table 3: Most strongly-weighted terms in models learned from various time periods (LOG1P model with unigrams and bigrams). “#” denotes any digit sequence.

EXPERIMENTS

EXPERIMENTS

For example :”estimates”, which averages one

occurrence per document even in the 1996–

2000 period, experiences the same term

frequency explosion, and goes through a

similar weight change, from strongly indicating

high volatility to strongly indicating low volatility

CONCLUSION

1.Testbed in NLP.

2.How well improvement this paper is present?

3.Interpretable weight can how commendatory

or derogatory term have changed.

4.Can compare information released by

different texts.

predict risk 副本

Education

financial reports

stock return volatility3

stock returnvolatility

problem formulation4

problem formualtion3

measurement of financial

predicting risk

evaluation method6