My Account Log in

2 options

Towards more automated statistical inference and machine learning / Matteo Sordello.

Connect to full text Available online

View online

Dissertations & Theses @ University of Pennsylvania Available online

View online
Format:
Book
Thesis/Dissertation
Author/Creator:
Sordello, Matteo, author.
Contributor:
Su, Weijie, degree supervisor.
University of Pennsylvania. Department of Statistics, degree granting institution.
Language:
English
Subjects (All):
Statistics.
Artificial intelligence.
Statistics--Penn dissertations.
Penn dissertations--Statistics.
Local Subjects:
Statistics.
Artificial intelligence.
Statistics--Penn dissertations.
Penn dissertations--Statistics.
Genre:
Academic theses.
Physical Description:
1 online resource (137 pages)
Contained In:
Dissertations Abstracts International 82-12B.
Place of Publication:
[Philadelphia, Pennsylvania] : University of Pennsylvania ; Ann Arbor : ProQuest Dissertations & Theses, 2021.
Language Note:
English
System Details:
Mode of access: World Wide Web.
text file
Summary:
While the amount of data that we are able to collect keeps growing, the use of Machine Learning and statistical models has become more computationally demanding. Moreover, especially in industry applications, these models need to be trained quickly and efficiently, while also being updated frequently. With this increased complexity comes the necessity of finding ways to make the creation and training of models as automated as possible. In the first part of this thesis, we develop two methods to adaptively tune the learning rate of iterative methods to perform optimization and sampling. While the two settings are substantially different, a similar underlying idea related to stationarity detection of the updates can be used to gain information about the current state of the system. The understanding of when (approximate) stationarity is achieved allows us to decay or increase the learning rate at the appropriate time, creating robust strategies that are not very sensitive to initial misspecification for this crucial parameter. In the second part of this thesis, we move our attention to Active Learning, a useful framework in which the burden of human annotation necessary to create the training data is reduced, since it is the model itself to indicate which training points are more likely to provide a better increase in its performance. Here, we improve the performance of a state-of-the-art method by including focused training in its training routine, allowing the model to select relevant points to use for the optimization phase instead of agnostically looping through all of them.
Notes:
Source: Dissertations Abstracts International, Volume: 82-12, Section: B.
Advisors: Su, Weijie; Committee members: James Johndrow; Dylan Small.
Department: Statistics.
Ph.D. University of Pennsylvania 2021.
Local Notes:
School code: 0175
ISBN:
9798505510971
Access Restriction:
Restricted for use by site license.
This item is not available from ProQuest Dissertations & Theses.
This item must not be sold to any third party vendors.

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account