Joy Zhang

Joy Zhang · 2026-05-03T16:47:37.296Z

At the Berkshire Hathaway Annual Shareholder Meeting, CEO Greg Abel put it plainly: "We're not going to do AI for the sake of AI... it has to be additive to our businesses." "Those large language models... they're large logic models. At this point in time, we're using it to solve logical challenges in our business, and what we're trying to do is in a more efficient fashion. Do it more quickly and get to a better answer." He described large language models as "large logic models" — tools for solving real logical challenges faster and more accurately. No hype. Just results. That's exactly the philosophy driving AI at GEICO. We're not chasing trends. We're using GenAI to fundamentally transform how insurance works — making smarter decisions, moving faster, and delivering better outcomes for our customers. If you want to build AI that actually matters — with real-world impact at massive scale — come build it with us. #GEICO #GenerativeAI #AIInInsurance #BerkshireHathaway #NowHiring #MachineLearning

United States
9K followers 500+ connections

View mutual connections with Joy

Joy can introduce you to 10+ people at GEICO

Email or phone

Password

Forgot password?

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Join to view profile

GEICO

Carnegie Mellon University

Articles by Joy

Why AI Skills Threatened SaaS? - The DoorDash vs. Blue Apron Analogy

Feb 8, 2026

Why AI Skills Threatened SaaS? - The DoorDash vs. Blue Apron Analogy

The DoorDash vs. Blue Apron Analogy Traditional SaaS = DoorDash 🚗 You want dinner, you order, someone else does…

9 Comments

Activity

I think it is amazing how we have evolved into a female led organization. GEICO continues to blaze the path in the financial sector. The…

I think it is amazing how we have evolved into a female led organization. GEICO continues to blaze the path in the financial sector. The…

Liked by Joy Zhang
So blessed to have spent a couple of days in DC last week with my GEICO peeps. Thankful for our wonderful leadership team who continues to pour…

So blessed to have spent a couple of days in DC last week with my GEICO peeps. Thankful for our wonderful leadership team who continues to pour…

Liked by Joy Zhang
At the Berkshire Hathaway Annual Shareholder Meeting, CEO Greg Abel put it plainly: "We're not going to do AI for the sake of AI... it has to be…

At the Berkshire Hathaway Annual Shareholder Meeting, CEO Greg Abel put it plainly: "We're not going to do AI for the sake of AI... it has to be…

Posted by Joy Zhang

Join now to see all activity

Experience

GEICO
-
-
-
-
-
-

Menlo Park, CA
-

Silicon Valley
-
-
-

Education

Carnegie Mellon University

-

2000 - 2007
-

1996 - 1999
-

1992 - 1996
-

1986 - 1992

Licenses & Certifications

Instrument Rating Pilot License

Federal Aviation Administration

Issued Nov 2020
Private Pilot License

Federal Aviation Administration

Issued Aug 2014

Volunteer Experience

Coach

Palo Alto Little League

Mar 2021 - Jun 2021 4 months

Children

Coach of Palo Alto Little League T-ball team Rockies
Team Manager

STANFORD SOCCER CLUB

Aug 2021 - Present 4 years 10 months

Children

Co-manage the soccer team

Publications

KeySens: Passive User Authentication through Micro-behavior Modeling of Soft Keyboard Interaction

The Proceedings of MobiCASE 2013, Fifth International Conference on Mobile Computing, Applications and Services November 8, 2013
Other authors
See publication
Privacy-preserved Mobile Sensing through Hybrid Cloud Trust Framework

IEEE 6th International Conference on Cloud Computing May 6, 2013
Other authors
See publication

Patents

Systems and methods for training a machine learning model for a second language based on a machine learning model for a first language

Issued June 1, 2023 US20230169388A1
Systems, methods, and non-transitory computer readable media can train a machine learning model for a first language to determine a classification for a content item in the first language. Machine translation can be performed to generate respective machine translations of a plurality of content items in a second language into the first language. Respective classifications for the plurality of content items in the second language can be determined based on the machine translations of the…

Systems, methods, and non-transitory computer readable media can train a machine learning model for a first language to determine a classification for a content item in the first language. Machine translation can be performed to generate respective machine translations of a plurality of content items in a second language into the first language. Respective classifications for the plurality of content items in the second language can be determined based on the machine translations of the plurality of content items in the second language and the machine learning model for the first language. Training data in the second language can be automatically generated, where the training data in the second language includes the plurality of content items in the second language and the respective classifications.

Other inventors
Labeling video files using acoustic vectors

Issued June 28, 2022 11,372,917
In one embodiment, a method includes receiving a video file. The video file includes a corresponding audio stream. The method further includes accessing the audio stream, and generating, based on the audio stream, a representative vector. The vector has a particular number of dimensions. The method further includes accessing a label-embedding space, which has the same particular number of dimensions, and includes a number of regions that each correspond to a respective label. The method further…

In one embodiment, a method includes receiving a video file. The video file includes a corresponding audio stream. The method further includes accessing the audio stream, and generating, based on the audio stream, a representative vector. The vector has a particular number of dimensions. The method further includes accessing a label-embedding space, which has the same particular number of dimensions, and includes a number of regions that each correspond to a respective label. The method further includes determining a region of the label-embedding space that corresponds to the vector, the determined region corresponding to a particular label. The method further includes associating the particular label with the video file.

Other inventors
See patent
Systems and methods for scraping URLs based on viewport views

Issued December 7, 2021 US 11,195,106

Systems, methods, and non-transitory computer readable media are configured to receive a uniform resource locator. A time and one or more features associated with the uniform resource locator can be provided to a first machine learning model. A prediction relating to a quantity of views the uniform resource locator achieves by the time can be received from the first machine learning model.
Post topic classification

Issued October 12, 2021 US 11,144,826
In one embodiment, a method includes accessing an input vector representing an input post, wherein: the vector space comprises clusters each associated with a topic; each cluster was determined based on a clustering of training-page vectors corresponding to training pages that each comprise training posts, each training post submitted by a user to a training page and comprises content selected by the user; and each training-page vector was generated by an ANN that was trained, based on the…

In one embodiment, a method includes accessing an input vector representing an input post, wherein: the vector space comprises clusters each associated with a topic; each cluster was determined based on a clustering of training-page vectors corresponding to training pages that each comprise training posts, each training post submitted by a user to a training page and comprises content selected by the user; and each training-page vector was generated by an ANN that was trained, based on the training posts of training pages associated with the ANN, to receive a post and then output a probability that the received post is related to the training posts of the training pages; determining that the input vector is located within a particular cluster in the vector space; and determining a topic of the input post based on the topic associated with the particular cluster that the input vector is located within.

Other inventors
See patent
Social hash for language models

Issued January 26, 2021 US 10,902,221
Components of language processing engines, such as translation models and language models, can be customized for groups of users or based on user type values. Users can be organized into groups or assigned a value on a continuum based on factors such as interests, biographical characteristics, social media interactions, etc. In some implementations, translation engine components can be customized for groups of users by selecting the training data from content created by users in that group. In…

Components of language processing engines, such as translation models and language models, can be customized for groups of users or based on user type values. Users can be organized into groups or assigned a value on a continuum based on factors such as interests, biographical characteristics, social media interactions, etc. In some implementations, translation engine components can be customized for groups of users by selecting the training data from content created by users in that group. In some implementations, the group identifier or continuum value can be part of the input into a general translation component allowing the translation component to take a language style of that user group into account when performing language processing tasks.

Other inventors
See patent
Automatic personalized story generation for visual media

Issued October 13, 2020 US 10664664

Exemplary embodiments relate to the automatic generation of captions for visual media, including photos, photo albums, non-live video, and live video. The visual media may be analyzed to determine contextual information (such as location information, people and objects in the video, time, etc.). A system may integrate this information with information from the user's social network and a personalized language model built using public-facing language from the user. The personalized language…

Exemplary embodiments relate to the automatic generation of captions for visual media, including photos, photo albums, non-live video, and live video. The visual media may be analyzed to determine contextual information (such as location information, people and objects in the video, time, etc.). A system may integrate this information with information from the user's social network and a personalized language model built using public-facing language from the user. The personalized language model captures the user's way of speaking to make the generated captions more detailed and personalized. The language model may account for the context in which the video was generated. The captions maybe used to simplify and encourage content generation, and may also be used to index visual media, rank the media, and recommend the media to users likely to engage with the media.

See patent
Extracting questions and answers

Issued September 1, 2020 US 10,762,438
A system for answering user questions can provide answers from a knowledge base that stores question/answer pairs. These pairs can be associated with characteristics of the asking user so that, when subsequent users ask similar questions, answers can be selected that have been identified as most relevant to that type of user. The question/answer pairs in the knowledge base can be identified from social media posts where the original post contains a question and one or more comments on the post…

A system for answering user questions can provide answers from a knowledge base that stores question/answer pairs. These pairs can be associated with characteristics of the asking user so that, when subsequent users ask similar questions, answers can be selected that have been identified as most relevant to that type of user. The question/answer pairs in the knowledge base can be identified from social media posts where the original post contains a question and one or more comments on the post provide an answer. Posts can be identified as containing a question using a question classification model. A post comment can be identified as an answer based on: whether the question poster responded positively to the comment, whether the comment has similar keywords to the question, whether the comment has the characteristics of an answer, and how often a similar answer has been provided for similar questions.

Other inventors
See patent
User clustering in a latent space for identifying user interest in a content item

Issued August 11, 2020 US 10,740,825
An online system targets users with a candidate content item. The online system generates a user embedding for each of a plurality of users by identifying content items interacted with by the user, identifying one or more keywords within the content items, identifying word embeddings for each of the identified keywords, and generating the user embedding with the word embeddings. The online system clusters the user embeddings while generating a cluster embedding for each cluster. The online…

An online system targets users with a candidate content item. The online system generates a user embedding for each of a plurality of users by identifying content items interacted with by the user, identifying one or more keywords within the content items, identifying word embeddings for each of the identified keywords, and generating the user embedding with the word embeddings. The online system clusters the user embeddings while generating a cluster embedding for each cluster. The online system generates a targeting embedding for the candidate content item. Then the online system generates a score for each cluster based on a comparison of the targeting embedding with each cluster embedding. From the generated scores, the online system ranks and selects some clusters for presentation of the candidate content item.

Other inventors
See patent
Crowdsourced chatbot answers

Issued June 23, 2020 US 10,692,006

A chatbot can use a knowledge base including question/answer pairs to respond to questions. When a question is asked that does not correspond to a question/answer pair in the knowledge base, the chatbot can send the question to one or more humans to obtain an answer. However, only some people will have the experience, context, knowledge, etc., to answer the question. A model can be trained to select "experts" that are likely to be able to provide a good answer to a question by using both A) a…

A chatbot can use a knowledge base including question/answer pairs to respond to questions. When a question is asked that does not correspond to a question/answer pair in the knowledge base, the chatbot can send the question to one or more humans to obtain an answer. However, only some people will have the experience, context, knowledge, etc., to answer the question. A model can be trained to select "experts" that are likely to be able to provide a good answer to a question by using both A) a vector comprising characteristics of questions and of the person posing the questions and B) a vector comprising characteristics of a possible expert. The model can trained to produce a value predicting how good an identified expert's answer is likely to be. The model can be trained based on measures of past answers provided for types of questions/questioners.

See patent
Systems and methods for training machine learning models for language clusters

Issued June 16, 2020 US 10,685,188
Systems, methods, and non-transitory computer readable media can generate a plurality of language clusters based on one or more of: language similarity between languages or social behavior similarity between languages. A representative language for a language cluster of the plurality of language clusters can be determined. For the language cluster of the plurality of language clusters, a machine learning model can be trained based on the representative language for the language cluster to…

Systems, methods, and non-transitory computer readable media can generate a plurality of language clusters based on one or more of: language similarity between languages or social behavior similarity between languages. A representative language for a language cluster of the plurality of language clusters can be determined. For the language cluster of the plurality of language clusters, a machine learning model can be trained based on the representative language for the language cluster to classify content items in languages included in the language cluster.

Other inventors
See patent
Language-agnostic understanding

Issued May 19, 2020 US 10,657,332
Exemplary embodiments relate to techniques to classify or detect the intent of content written in a language for which a classifier does not exist. These techniques involve building a code-switching corpus via machine translation, generating a universal embedding for words in the code-switching corpus, training a classifier on the universal embeddings to generate an embedding mapping/table; accessing new content written in a language for which a specific classifier may not exist, and mapping…

Exemplary embodiments relate to techniques to classify or detect the intent of content written in a language for which a classifier does not exist. These techniques involve building a code-switching corpus via machine translation, generating a universal embedding for words in the code-switching corpus, training a classifier on the universal embeddings to generate an embedding mapping/table; accessing new content written in a language for which a specific classifier may not exist, and mapping entries in the embedding mapping/table to the universal embeddings. Using these techniques, a classifier can be applied to the universal embedding without needing to be trained on a particular language. Exemplary embodiments may be applied to recognize similarities in two content items, make recommendations, find similar documents, perform deduplication, and perform topic tagging for stories in foreign languages.

Other inventors
See patent
Deep translations

Issued March 10, 2020 US US10586168B2
Other inventors
Incorporation of user-provided natural language translations in a social networking system

Issued January 7, 2020 US US10528677B1
Other inventors
Associating a user identity with a mobile device identity

Issued July 16, 2019 US 10,354,145
A system includes, in one aspect, one or more processing devices that perform operations comprising: detecting one or more human objects in images captured by a visual image recording device; obtaining a motion time series for each of the detected one or more human objects using the captured images; obtaining a received signal strength (RSS) time series for each of the one or more mobile devices, based on received RF signals from the one or more mobile devices; and generating an association…

A system includes, in one aspect, one or more processing devices that perform operations comprising: detecting one or more human objects in images captured by a visual image recording device; obtaining a motion time series for each of the detected one or more human objects using the captured images; obtaining a received signal strength (RSS) time series for each of the one or more mobile devices, based on received RF signals from the one or more mobile devices; and generating an association between (i) identifying data for a first mobile device of the one or more mobile devices, and (ii) identifying data for one of the one or more human objects representing a first human, wherein the first mobile device has an RSS time series that fluctuates at a time period corresponding to movement in the obtained motion time series for the one of the one or more human objects representing the first human.

Other inventors
See patent
Crowd matching translators

Issued April 9, 2019 US US10255277B2
Exemplary embodiments relate to techniques for selecting translators willing to provide high-quality translations for a cause, organization, or individual. Users having a high level of engagement with the cause, organization, or individual may be identified as translator candidates. For example, the user may actively engage with the organization or individual on social media, or may be interested in the topics discussed in the source document. The translators may be evaluated based on the…

Exemplary embodiments relate to techniques for selecting translators willing to provide high-quality translations for a cause, organization, or individual. Users having a high level of engagement with the cause, organization, or individual may be identified as translator candidates. For example, the user may actively engage with the organization or individual on social media, or may be interested in the topics discussed in the source document. The translators may be evaluated based on the quality of their previous translations and their level of engagement/interest. The translator candidates may be directly connected with the originator of the request to translate the document. Because exemplary embodiments select highly engaged users to translate the source document, the resulting translation is likely to be of higher quality, and produced at a lower cost, than a translation by a non-engaged user, and user participation and awareness of a cause, organization, or individual may be increased.

Other inventors
See patent
Providing translations of electronic messages via a social networking system

Issued November 6, 2018 US 20160191448A1

See patent
Optimizing machine translations for user engagement

Issued October 30, 2018 US 10114819B2

See patent
Language model personalization

Issued June 19, 2018 US 20170185583

See patent
Language independent representations

Filed June 5, 2018 US 20170103062A1

Snippets can be represented in a language-independent semantic manner. Each portion of a snippet can be represented by a combination of a semantic representation and a syntactic representation, each in its own dimensional space. A snippet can be divided into portions by constructing a dependency structure based on relationships between words and phrases. Leaf nodes of the dependency structure can be assigned: A) a semantic representation according to pre-defined word mappings and B) a syntactic…

Snippets can be represented in a language-independent semantic manner. Each portion of a snippet can be represented by a combination of a semantic representation and a syntactic representation, each in its own dimensional space. A snippet can be divided into portions by constructing a dependency structure based on relationships between words and phrases. Leaf nodes of the dependency structure can be assigned: A) a semantic representation according to pre-defined word mappings and B) a syntactic representation according to the grammatical use of the word. A trained semantic model can assign to each non-leaf node of the dependency structure a semantic representation based on a combination of the semantic and syntactic representations of the corresponding lower-level nodes. A trained syntactic model can assign to each non-leaf node a syntactic representation based on a combination of the syntactic representations of the corresponding lower-level nodes and the semantic representation of that node.

See patent
User feedback for low-confidence translations

Issued March 20, 2018 US 9,922,029

See patent
Incorporation of user-provided natural language translations in a social networking system

Issued February 20, 2018 US 9,898,461

See patent
Mining multi-lingual data

Issued January 9, 2018 US 9,864,744

See patent
Determining trending topics in social media

Issued November 28, 2017 US 9,830,386

See patent
Multilingual Business Intelligence for Actions

Issued November 28, 2017 US 201,601,886,61A1

See patent
Analyzing language dependency structures

Issued November 28, 2017 US 9,830,404

See patent
Contrastive multilingual business intelligence

Issued November 28, 2017 US 20160188703A1

See patent
Travel recommendations on online social networks

Issued November 14, 2017 US 9,820,094

See patent
Predicting future translations

Issued October 31, 2017 US 9,805,029

See patent
Predicting future translations

Issued August 19, 2017 US 9,747,283

See patent

Languages

Chinese

-

More activity by Joy

Thank you to everyone for joining yesterday's AI Discussion Club walk to discuss the story of NVIDIA and Jensen Huang. Our next event will be on May…

Thank you to everyone for joining yesterday's AI Discussion Club walk to discuss the story of NVIDIA and Jensen Huang. Our next event will be on May…

Liked by Joy Zhang
This indeed was one of the best! Come join us for unbelievably exciting journey!

This indeed was one of the best! Come join us for unbelievably exciting journey!

Liked by Joy Zhang
Outstanding team and outstanding team effort. A pleasure to work with the best!

Outstanding team and outstanding team effort. A pleasure to work with the best!

Liked by Joy Zhang
BH Annual Shareholders Meeting: Greg Abel, CEO of Berkshire Hathaway and succcessor of Warren Buffett recognizing the technology transformation we…

BH Annual Shareholders Meeting: Greg Abel, CEO of Berkshire Hathaway and succcessor of Warren Buffett recognizing the technology transformation we…

Liked by Joy Zhang
An autonomous robot for picking up waste in waters. This is what we needed robots and AI for, not for painting pictures and writing songs.

An autonomous robot for picking up waste in waters. This is what we needed robots and AI for, not for painting pictures and writing songs.

Liked by Joy Zhang
The AI data flywheel is at the core of building safe, reliable autonomy in complex environments. It’s a continuous learning loop: real-world…

The AI data flywheel is at the core of building safe, reliable autonomy in complex environments. It’s a continuous learning loop: real-world…

Liked by Joy Zhang
5 Years and 1 Shared Vision This spring, I’m celebrating a special milestone. Five years ago in April, I stepped into the co-CEO role at Waymo…

5 Years and 1 Shared Vision This spring, I’m celebrating a special milestone. Five years ago in April, I stepped into the co-CEO role at Waymo…

Liked by Joy Zhang
Quarter pound breakfast burritos to celebrate the team that just pounded the quarter! Team LangChain 🐐

Quarter pound breakfast burritos to celebrate the team that just pounded the quarter! Team LangChain 🐐

Liked by Joy Zhang

View Joy’s full profile

See who you know in common
Get introduced
Contact Joy directly

Join to view full profile

Other similar profiles

Abhimanu Kumar

Abhimanu Kumar

San Francisco Bay Area

Connect
Jeong-Yoon Lee

Jeong-Yoon Lee

Los Angeles Metropolitan Area

Connect
Alex Jaimes, Ph.D.

Alex Jaimes, Ph.D.

New York City Metropolitan Area

Connect
Tanvi Motwani

Tanvi Motwani

Mountain View, CA

Connect
Yael Garten

Yael Garten

San Francisco Bay Area

Connect
Pilar Manchón PhD.

Pilar Manchón PhD.

San Francisco Bay Area

Connect
Lukas Kidzinski

Lukas Kidzinski

Menlo Park, CA

Connect
Mohammad Shokoohi-Yekta

Mohammad Shokoohi-Yekta

Redmond, WA

Connect
Pieter De Leenheer

Pieter De Leenheer

New York, NY

Connect
Massimiliano Versace

Massimiliano Versace

Boston, MA

Connect
Estevam Hruschka

Estevam Hruschka

Mountain View, CA

Connect
Guy Lebanon

Guy Lebanon

San Francisco Bay Area

Connect
Kumar Chellapilla

Kumar Chellapilla

Mountain View, CA

Connect
Bo (Rambo) Tan

Bo (Rambo) Tan

San Francisco Bay Area

Connect
Toufi Saliba

Toufi Saliba

San Francisco, CA

Connect
Jay Bartot

Jay Bartot

Seattle, WA

Connect
Scott Clark

Scott Clark

Palo Alto, CA

Connect
Rohit Prasad

Rohit Prasad

Lexington, MA

Connect
Neal Sales-Griffin

Neal Sales-Griffin

Chicago, IL

Connect
Reid Hoffman

Reid Hoffman

United States

Connect

Explore more posts

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Add new skills with these courses

See all courses

Joy Zhang

United States 9K followers 500+ connections

Articles by Joy

Why AI Skills Threatened SaaS? - The DoorDash vs. Blue Apron Analogy

Activity

I think it is amazing how we have evolved into a female led organization. GEICO continues to blaze the path in the financial sector. The…

Liked by Joy Zhang

So blessed to have spent a couple of days in DC last week with my GEICO peeps. Thankful for our wonderful leadership team who continues to pour…

Liked by Joy Zhang

At the Berkshire Hathaway Annual Shareholder Meeting, CEO Greg Abel put it plainly: "We're not going to do AI for the sake of AI... it has to be…

Posted by Joy Zhang

Experience

-

-

-

-

-

-

-

-

-

-

Education

-

-

-

-

Licenses & Certifications

Instrument Rating Pilot License

Private Pilot License

Volunteer Experience

Coach

Team Manager

STANFORD SOCCER CLUB

Publications

The Proceedings of MobiCASE 2013, Fifth International Conference on Mobile Computing, Applications and Services November 8, 2013

IEEE 6th International Conference on Cloud Computing May 6, 2013

Patents

Systems and methods for training a machine learning model for a second language based on a machine learning model for a first language

Issued June 1, 2023 US20230169388A1

Issued June 28, 2022 11,372,917

Systems and methods for scraping URLs based on viewport views

Issued December 7, 2021 US 11,195,106

Issued October 12, 2021 US 11,144,826

Issued January 26, 2021 US 10,902,221

Issued October 13, 2020 US 10664664

Issued September 1, 2020 US 10,762,438

Issued August 11, 2020 US 10,740,825

Issued June 23, 2020 US 10,692,006

Issued June 16, 2020 US 10,685,188

Issued May 19, 2020 US 10,657,332

Deep translations

Issued March 10, 2020 US US10586168B2

Incorporation of user-provided natural language translations in a social networking system

Issued January 7, 2020 US US10528677B1

Issued July 16, 2019 US 10,354,145

Issued April 9, 2019 US US10255277B2

Issued November 6, 2018 US 20160191448A1

Issued October 30, 2018 US 10114819B2

Issued June 19, 2018 US 20170185583

Filed June 5, 2018 US 20170103062A1

Issued March 20, 2018 US 9,922,029

Issued February 20, 2018 US 9,898,461

Issued January 9, 2018 US 9,864,744

Issued November 28, 2017 US 9,830,386

Issued November 28, 2017 US 201,601,886,61A1

Issued November 28, 2017 US 9,830,404

Issued November 28, 2017 US 20160188703A1

Issued November 14, 2017 US 9,820,094

Issued October 31, 2017 US 9,805,029

Issued August 19, 2017 US 9,747,283

Languages

Chinese

-

More activity by Joy

Thank you to everyone for joining yesterday's AI Discussion Club walk to discuss the story of NVIDIA and Jensen Huang. Our next event will be on May…

Liked by Joy Zhang

This indeed was one of the best! Come join us for unbelievably exciting journey!

Liked by Joy Zhang

Outstanding team and outstanding team effort. A pleasure to work with the best!

United States
9K followers 500+ connections