Elvis S.

Elvis S. · 2026-05-15T17:16:01.210Z

// Is Grep All You Need? // Pay attention to this, AI devs. (bookmark it) They find that grep-style text search, when wrapped in the right agent harness, matches or beats embedding-based retrieval on coding-agent tasks. Are vector databases even needed where this is all going? It might be that what coding agents needed was not better embeddings. It was a better harness design around primitive tools. If you operate a coding-agent stack that depends on a vector DB, it might be time to re-evaluate. My personal experience on this has been that agentic search, if done right, is more than good enough for a lot of use cases. But you also have to understand how to properly index and structure information for the agents to take advantage. At scale, vector databases do shine, so take that into account as well. In most cases, a hybrid approach often works best, but that's something we haven't figured out really well as of yet.

Belmopan, Belize

Sign in to view Elvis’ full profile

Elvis can introduce you to 1 people at DAIR.AI

Email or phone

Password

Forgot password?

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

86K followers 500+ connections

View mutual connections with Elvis

Elvis can introduce you to 1 people at DAIR.AI

Email or phone

Password

Forgot password?

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Join to view profile

DAIR.AI

National Tsing Hua University

About

Building DAIR.AI, wherein we are democratizing AI research, education, and technologies…

Services

Request proposal

Articles by Elvis

OpenAI Introduces Operator & Agents

Jan 23, 2025

OpenAI Introduces Operator & Agents

OpenAI Introduces Operator & Agents! Here is everything you need to know: Operator is a system that can use a web…

1 Comment
My Favorite LLM Papers for October

Oct 30, 2023

My Favorite LLM Papers for October

Here's a list of my favorite LLM papers I read this month: 1/ Zephyr LLM - a 7B parameter model with competitive…

2 Comments
Tracking LLMs with Comet

Aug 9, 2023

Tracking LLMs with Comet

When building with LLMs, you will spend a lot of time optimizing prompts and diagnosing LLMs. As you put your solutions…

3 Comments
How To Build a Custom Chat LLM on Your Data

Jul 3, 2023

How To Build a Custom Chat LLM on Your Data

This is one of the fastest ways to build a custom ChatGPT-like system on top of your data. It's called ChatLLM (by…

2 Comments
Data Exploration with Chat Powered by GPT-4

Mar 30, 2023

Data Exploration with Chat Powered by GPT-4

As an ML Engineer, this is one of the most useful applications of GPT-4 I've seen. Chat Explore is a powerful…

6 Comments
Open Source Solution Replicates ChatGPT Training Process

Feb 21, 2023

Open Source Solution Replicates ChatGPT Training Process

ChatGPT is the biggest buzz in AI today! ChatGPT demonstrates remarkable capabilities so there is a high interest to…

7 Comments
New Conversational AI Tool Lets You “Chat” With Your Data

Feb 14, 2023

New Conversational AI Tool Lets You “Chat” With Your Data

As an ML engineer, one area where I spend a lot of time is data engineering. Can we use conversational AI technologies…

8 Comments
Analyzing Worldwide Energy Production with Kibana Lens

Dec 23, 2019

Analyzing Worldwide Energy Production with Kibana Lens

While there are many tools that can be used to perform a quick analysis of large-scale data, data analysis in itself is…

1 Comment
XLNet outperforms BERT on several NLP Tasks

Jun 30, 2019

XLNet outperforms BERT on several NLP Tasks

Two pretraining objectives that have been successful for pretraining neural networks used in transfer learning NLP are…

1 Comment

See all articles

Activity

86K followers

Elvis S.

Elvis S.

11h
Report this post
Elvis S. shared this
The best way to learn AI is to build with agents. To help with that, we've launched hands-on labs and a new series on Agentic Engineering. First topic: Agent Skills. Next in the pipeline: planning, context engineering, multi-agent systems, long-running agents,.. Go build! Full video in the comments.

public_profile__posts
2 Comments
Elvis S.

Elvis S.

1d
Report this post
Elvis S. shared this
New VIDEO: From LLM Wikis to LLM Artifacts Shared all my thoughts on why LLM wikis and HTML artifacts are a big deal. Plus, new tools to help you build wikis and artifacts with agents. Check the comments for links. Just getting started!

public_profile__posts
4 Comments
Elvis S.

Elvis S.

2d
Report this post
Elvis S. shared this
I highly recommend this. The Agentic Review is a new podcast from Qodo hosted by Itamar Friedman and Nnenna Ndukwe ~ AI and Emerging Tech, and it's a great AI coding show that's neither hype nor doom. It's honest conversations about what shipping high-quality AI-generated code actually looks like. We need more of these conversations around AI agents. As an AI engineer, I think about this stuff constantly. A few things make this show worth your time: * It's a conversation about what good code means in the era of coding agents. * The hosts actually push back at guests instead of doing softball interviews. * The current guest lineup is strong: Dexter Horthy, Scott Hanselman, and Matthew Makai. I work a lot on context engineering, so the Dexter Horthy episode resonates the most for me. His take on context engineering as one of the biggest moats right now matches exactly what I'm seeing in production. He talks about a five-month experiment where his team stopped reading the code, then ripped it all out and rebuilt it by hand. That lesson about owning your context and actually reading what your agents produce is something every AI engineer needs to understand today. The bigger thesis across episodes: typing code may be dying, but the SDLC, code review, and craft matter more than ever. AI sprinkled on a broken software lifecycle is "a band-aid on cancer" (Hanselman's line, and it stuck with me). Thanks, Qodo, for the partnership on this post.

public_profile__posts
6 Comments
Elvis S.

Elvis S.

3d
Report this post
Elvis S. shared this
Very interesting results from this NanoGPT-Bench eval. There is so much talk about self-improving agents. But can coding agents do real AI R&D? @IntologyAI reports that Codex, Claude Code, and Autoresearch recover only 9.3% of human progress. Coding agents spend more of their compute on hyperparameter tuning. In fact, coding agents rarely attempt algorithmic research at all. Claude Code and Autoresearch both reason more about algorithmic research, but still dodge implementation. Read more here: https://lnkd.in/e8KeqqJ9

Intology

Intology

3d

Elvis S. shared this
Can coding agents do research? We release NanoGPT-Bench, an internal eval we’ve used to test agents on an AI R&D problem with months of human progress. Codex, Claude Code, Autoresearch recover only 9.3% of human progress, mostly tuning hyperparams & ignoring algorithmic research. NanoGPT-Bench is built on the NanoGPT Speedrun, a popular LLM pretraining competition to minimize the training time of a GPT-2 style model. Existing human submissions constitute nearly 2 years of work. To control for dependencies and contamination in frontier models, we standardize evaluation to a 5-month window of world records. Evaluation is fully autonomous and end-to-end, with no human intervention nor internet access. We found that: 1. Coding agents mostly spend compute on hyperparameter tuning, rarely attempting the algorithmic research that make human records successful. In one instance, Codex spent 121 H100 hours adjusting two values in the training code: cooldown fraction and window size schedule parameters. 2. When coding agents consider algorithmic work, they rarely succeed. Instead, they either reason themselves away or regress performance. For example, Autoresearch repeatedly considered reducing the number of value embeddings from 3 to 2, but avoided the change after deeming it risky without any experimentation. We thank Larry Dial for helpful discussions, Keller Jordan for the original NanoGPT Speedrun, as well as all human contributors for their efforts in producing world records! Blog: https://lnkd.in/gxAwwn-c GitHub: https://lnkd.in/gy-X8vxH

public_profile__posts
4 Comments
Elvis S. reposted this
Report this post
Elvis S. reposted this

Intology

Intology

3d

Elvis S. reposted this
Can coding agents do research? We release NanoGPT-Bench, an internal eval we’ve used to test agents on an AI R&D problem with months of human progress. Codex, Claude Code, Autoresearch recover only 9.3% of human progress, mostly tuning hyperparams & ignoring algorithmic research. NanoGPT-Bench is built on the NanoGPT Speedrun, a popular LLM pretraining competition to minimize the training time of a GPT-2 style model. Existing human submissions constitute nearly 2 years of work. To control for dependencies and contamination in frontier models, we standardize evaluation to a 5-month window of world records. Evaluation is fully autonomous and end-to-end, with no human intervention nor internet access. We found that: 1. Coding agents mostly spend compute on hyperparameter tuning, rarely attempting the algorithmic research that make human records successful. In one instance, Codex spent 121 H100 hours adjusting two values in the training code: cooldown fraction and window size schedule parameters. 2. When coding agents consider algorithmic work, they rarely succeed. Instead, they either reason themselves away or regress performance. For example, Autoresearch repeatedly considered reducing the number of value embeddings from 3 to 2, but avoided the change after deeming it risky without any experimentation. We thank Larry Dial for helpful discussions, Keller Jordan for the original NanoGPT Speedrun, as well as all human contributors for their efforts in producing world records! Blog: https://lnkd.in/gxAwwn-c GitHub: https://lnkd.in/gy-X8vxH

public_profile__posts
2 Comments
Elvis S.

Elvis S.

3d
Report this post
Elvis S. shared this
// Code as Agent Harness // 100+ page report on all things related to agent harnesses. (bookmark it) In particular, the survey summarizes methods and applications of code as agent harness. This paper makes a strong case that code-as-harness might be the key to moving us towards a broader science harness engineering. Is code all you need? Maybe. Regardless, the paper argues that future systems must have the following four properties: executable, inspectable, stateful, and governed.

public_profile__posts
12 Comments
Elvis S.

Elvis S.

4d
Report this post
Elvis S. shared this
NEW paper from Meta. (bookmark it) It's an agent system that autonomously discovers neural architectures that beat Llama 3.2 at 350M, 1B, and 3B scales, all under a 24-hour compute budget. They get this work by splitting the search into two agents: > AIRA-Compose searches the macro architecture. > AIRA-Design implements the low-level mechanisms. For devs: If one agent in your stack is doing both strategy and implementation, split it. Run a planner that picks the structure and an implementer that fills in the mechanisms. AIRA shows this beats a single end-to-end agent on a real, non-toy search problem. The same split is useful for pipeline assembly, query planning, prompt scaffolding, and tool-use programs.

public_profile__posts
7 Comments
Elvis S.

Elvis S.

6d
Report this post
Elvis S. shared this
Interesting interpretability paper on tool-using agents. The authors probe hidden states and find the model often recognizes it should call a tool, but fails to actually call one. The mismatch ranges from 26 to 54%, and it concentrates entirely in the cognition-to-action transition, not in cognition itself. In other words, the model usually knows it should call the tool. The internal probe direction is decodable. But the late-layer last-token regime rotates that signal nearly orthogonal to the action it produces. This work tries to predict which interventions will actually work and which will not. Most will blame bad prompting or weak tool-call training, and probably ignore the late-layer geometry. If you have been A/B testing tool-use prompts and getting weird ceilings, this work might offer a good explanation to that behavior.

public_profile__posts
6 Comments
Elvis S.

Elvis S.

1w
Report this post
Elvis S. shared this
// Is Grep All You Need? // Pay attention to this, AI devs. (bookmark it) They find that grep-style text search, when wrapped in the right agent harness, matches or beats embedding-based retrieval on coding-agent tasks. Are vector databases even needed where this is all going? It might be that what coding agents needed was not better embeddings. It was a better harness design around primitive tools. If you operate a coding-agent stack that depends on a vector DB, it might be time to re-evaluate. My personal experience on this has been that agentic search, if done right, is more than good enough for a lot of use cases. But you also have to understand how to properly index and structure information for the agents to take advantage. At scale, vector databases do shine, so take that into account as well. In most cases, a hybrid approach often works best, but that's something we haven't figured out really well as of yet.

public_profile__posts
50 Comments

Elvis S. liked this
Report this post
Elvis S. liked this
The best way to learn AI is to build with agents. To help with that, we've launched hands-on labs and a new series on Agentic Engineering. First topic: Agent Skills. Next in the pipeline: planning, context engineering, multi-agent systems, long-running agents,.. Go build! Full video in the comments.

public_profile__reactions
2 Comments
Elvis S. liked this
Report this post
Elvis S. liked this

DAIR.AI

DAIR.AI

2d

Elvis S. liked this
If you design production agent systems, this matters. Most devs accidentally let their framework defaults make critical architecture decisions without thinking it through. This paper shows you how to choose deliberately instead. Why it matters? You need to start making more deliberate decision about your architecture.

public_profile__reactions
2 Comments
Elvis S. liked this
Report this post
Elvis S. liked this

n8n

n8n

1d

Elvis S. liked this
The AI Confidence Crisis: Why 92% of C-suite Leaders are Confident in AI ROI, but 75% Lack the Governance to Prove It. A dangerous AI maturity gap is defining the enterprise landscape in 2026. While 92% of C-suite leaders express full confidence in AI ROI, 75% lack the governance frameworks, and 58% cite unclear ownership as barriers to measuring performance. The technology is rarely the bottleneck. Success hinges on governance, ownership, infrastructure, and a willingness to redesign work. We introduce a practical 5-Level AI Maturity Framework to help leadership assess where their organization stands and understand the specific shifts required to advance. Read the first installment of our series to learn: ➡️ The critical distinction between Generative and Agentic AI ➡️ The risks of the Shadow AI vulnerability ➡️ The complete 5-level framework for self-assessment 🔗 Find out how to move your enterprise from pilot purgatory to systemic AI value: https://bit.ly/3Rip9tf

public_profile__reactions
8 Comments
Elvis S. liked this
Report this post
Elvis S. liked this
New VIDEO: From LLM Wikis to LLM Artifacts Shared all my thoughts on why LLM wikis and HTML artifacts are a big deal. Plus, new tools to help you build wikis and artifacts with agents. Check the comments for links. Just getting started!

public_profile__reactions
4 Comments

See all activities

Experience & Education

DAIR.AI

******

******** *** *********** *** ****** *********
****

********* ******* ********* ******** **** **
******** ***** *** **********

****** ** ********** * *** *********** ******* *** ************* ** undefined

2015 - 2019
******** ***** *** **********

******** ****** *********** ******* *** ************* **

2013 - 2015

View Elvis’s full experience

See their title, tenure and more.

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Licenses & Certifications

AI Accountability Essential Training

LinkedIn

Issued Oct 2020

See credential
Learning Cloud Computing: Cloud Storage

LinkedIn

Issued Oct 2020

See credential
Elements of AI

University of Helsinki

Issued Jan 2020

See credential
Sequence Models

Coursera

Issued Feb 2019

Credential ID LHMGYVT8UQDR

See credential
Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization

Coursera

Issued Jan 2019

Credential ID BLP3FHDE59BQ

See credential
Structuring Machine Learning Projects

Coursera

Issued Jan 2019

Credential ID V3HQ3HNFTZJ6

See credential
Neural Networks and Deep Learning

Coursera

Issued Oct 2017

Credential ID ZRU889QBQFAS

See credential
Coursera Mentor Community and Training Course

Coursera

Issued Jul 2016

Credential ID V94MAB5B4YZ9

See credential
Machine Learning Foundations: A Case Study Approach

Coursera

Issued Apr 2016

Credential ID 68L5LXXYBKV6

See credential
Elastic Certified Engineer

Elastic

Issued Nov 2019 Expires Nov 2021

See credential

Publications

Galactica: A Large Language Model for Science

arXiv November 16, 2022

Information overload is a major obstacle to scientific progress. The explosive growth in scientific literature and data has made it ever harder to discover useful insights in a large mass of information. Today scientific knowledge is accessed through search engines, but they are unable to organize scientific knowledge alone. In this paper we introduce Galactica: a large language model that can store, combine and reason about scientific knowledge. We train on a large scientific corpus of papers,…

Information overload is a major obstacle to scientific progress. The explosive growth in scientific literature and data has made it ever harder to discover useful insights in a large mass of information. Today scientific knowledge is accessed through search engines, but they are unable to organize scientific knowledge alone. In this paper we introduce Galactica: a large language model that can store, combine and reason about scientific knowledge. We train on a large scientific corpus of papers, reference material, knowledge bases and many other sources. We outperform existing models on a range of scientific tasks. On technical knowledge probes such as LaTeX equations, Galactica outperforms the latest GPT-3 by 68.2% versus 49.0%. Galactica also performs well on reasoning, outperforming Chinchilla on mathematical MMLU by 41.3% to 35.7%, and PaLM 540B on MATH with a score of 20.4% versus 8.8%. It also sets a new state-of-the-art on downstream tasks such as PubMedQA and MedMCQA dev of 77.6% and 52.9%. And despite not being trained on a general corpus, Galactica outperforms BLOOM and OPT-175B on BIG-bench. We believe these results demonstrate the potential for language models as a new interface for science. We open source the model for the benefit of the scientific community.

See publication
CARER: Contextualized Affect Representations for Emotion Recognition

Empirical Methods in Natural Language Processing (EMNLP) 2018
Emotions are expressed in nuanced ways, which varies by collective or individual experiences, knowledge, and beliefs. Therefore, to understand emotion, as conveyed through text, a robust mechanism capable of capturing and modeling different linguistic nuances and phenomena is needed. We propose a semi-supervised, graph-based algorithm to produce rich structural descriptors which serve as the building blocks for constructing contextualized affect representations from text. The pattern-based…

Emotions are expressed in nuanced ways, which varies by collective or individual experiences, knowledge, and beliefs. Therefore, to understand emotion, as conveyed through text, a robust mechanism capable of capturing and modeling different linguistic nuances and phenomena is needed. We propose a semi-supervised, graph-based algorithm to produce rich structural descriptors which serve as the building blocks for constructing contextualized affect representations from text. The pattern-based representations are further enriched with word embeddings and evaluated through several emotion recognition tasks. Our experimental results demonstrate that the proposed method outperforms state-of-the-art techniques on emotion recognition tasks.

Other authors
See publication
A Dynamic Influence Keyword Model for Identifying Implicit User Interests on Social Networks

ASONAM - IEEE 2017
The rapid growth of social networks have enabled users to instantly share what is happening around them. With the character-limitation and other feature constraints imposed by microblogs, users are obliged to express their intentions in implicit forms. This behavior poses many challenges for contextual approaches that aim to identify user intentions. Furthermore, users have the tendency to display different degree of preferences towards specific interests, simultaneously in time, making it…

The rapid growth of social networks have enabled users to instantly share what is happening around them. With the character-limitation and other feature constraints imposed by microblogs, users are obliged to express their intentions in implicit forms. This behavior poses many challenges for contextual approaches that aim to identify user intentions. Furthermore, users have the tendency to display different degree of preferences towards specific interests, simultaneously in time, making it difficult for models to rank the discovered interests. We propose a dynamic interest keyword model, a graph-based ranking mechanism, that identifies the different degrees of interests of a user. Our results show that the proposed system detects human-inferred interests, 94% of the time, showing that the model is feasible and contributes various insights that can be used to improve user intention identification systems.

Other authors
See publication
Clustering Social News Based on User Affection

Conference on Technologies and Applications of Artificial Intelligence (TAAI) - IEEE 2017
Recently, several news aggregation services have emerged to deal with the problem of information overload and news personalization. These news providers are able to organize news based on content similarity as a strategy to improve the user reading experience. However, organizing news solely on content fails to consider actual human reading behavior, and in turn ignores the importance of user perception in news personalization. We propose an enhanced news clustering technique based on an user…

Recently, several news aggregation services have emerged to deal with the problem of information overload and news personalization. These news providers are able to organize news based on content similarity as a strategy to improve the user reading experience. However, organizing news solely on content fails to consider actual human reading behavior, and in turn ignores the importance of user perception in news personalization. We propose an enhanced news clustering technique based on an user affect model, which is a feasible framework for news categorization that can contribute to building more human-centric interactive systems. Empirical results demonstrate the effectiveness of clustering news articles through the enrichment of a user affect model when compared to traditional keyword-based clustering.

Other authors
See publication
MIDAS: Mental illness detection and analysis via social media

ASONAM - IEEE 2016
Mental illnesses rank as some of the most disabling conditions, affecting millions of people, across the globe. In general, the main challenge of mental disorders is that they remain difficult to detect on suffering patients. In an online environment, the challenge extends to the collection of patients data and the implementation of proper algorithms to assist in the detection of such illnesses. In this paper, we propose a novel data collection mechanism and build predictive models that…

Mental illnesses rank as some of the most disabling conditions, affecting millions of people, across the globe. In general, the main challenge of mental disorders is that they remain difficult to detect on suffering patients. In an online environment, the challenge extends to the collection of patients data and the implementation of proper algorithms to assist in the detection of such illnesses. In this paper, we propose a novel data collection mechanism and build predictive models that leverage language and behavioral patterns, used particularly on Twitter, to determine whether a user is suffering from a mental disorder. After training the predictive models, they are further pre-trained to serve as the backend for our demonstration, MIDAS. MIDAS offers an analytics web-service to explore several characteristics pertaining to user's linguistic and behavioral patterns on social media, with respect to mental illnesses.

Other authors
See publication
Subconscious crowdsourcing: a feasible data collection mechanism for mental disorder detection on social media

ASONAM - IEEE 2016
Mental disorders are currently affecting millions of people from different cultures, age groups and geographic regions. The challenge of mental disorders is that they are difficult to detect on suffering patients, thus presenting an alarming number of undetected cases and misdiagnosis. In this paper, we aim at building predictive models that leverage language and behavioral patterns, used particularly in social media, to determine whether a user is suffering from two cases of mental disorder…

Mental disorders are currently affecting millions of people from different cultures, age groups and geographic regions. The challenge of mental disorders is that they are difficult to detect on suffering patients, thus presenting an alarming number of undetected cases and misdiagnosis. In this paper, we aim at building predictive models that leverage language and behavioral patterns, used particularly in social media, to determine whether a user is suffering from two cases of mental disorder. These predictive models are made possible by employing a novel data collection process, coined as Subconscious Crowdsourcing, which helps to collect a faster and more reliable dataset of patients. Our experiments suggest that extracting specific language patterns and social interaction features from reliable patient datasets can greatly contribute to further analysis and detection of mental disorders.

Other authors
See publication
Unsupervised graph-based pattern extraction for multilingual emotion classification

Social Network Analysis and Mining - Springer 2016
The connected society we live in today has allowed online users to willingly share opinions on an unprecedented scale. Motivated by the advent of mass opinion sharing, it is then crucial to devise algorithms that efficiently identify the emotions expressed within the opinionated content. Traditional opinion-based classifiers require extracting high-dimensional feature representations, which become computationally expensive to process and can misrepresent or deteriorate the accuracy of a…

The connected society we live in today has allowed online users to willingly share opinions on an unprecedented scale. Motivated by the advent of mass opinion sharing, it is then crucial to devise algorithms that efficiently identify the emotions expressed within the opinionated content. Traditional opinion-based classifiers require extracting high-dimensional feature representations, which become computationally expensive to process and can misrepresent or deteriorate the accuracy of a classifier. In this paper, we propose an unsupervised graph-based approach for extracting Twitter-specific emotion-bearing patterns to be used as features. By utilizing a more representative list of patterns, as features, we improved the precision and recall of a given emotion classification task. Due to its novel bootstrapping process, the full system is also adaptable to different domains and languages. The experimented results demonstrate that the extracted patterns are effective in identifying emotions for English, Spanish, and French Twitter streams. We also provide detailed experiments and offer an extended version of our algorithm to support the classification of Indonesian microblog posts. Overall, our empirical experimented results demonstrate that the proposed approach bears desirable characteristics such as accuracy, generality, adaptability, minimal supervision, and coverage.

Other authors
See publication
Concept-based event identification from social streams using evolving social graph sequences

Social Network Analysis and Mining - Springer 2015
Social networks, which have become extremely popular in the twenty first century, contain a tremendous amount of user-generated content about real-world events. This user-generated content relays real-world events as they happen, and sometimes even ahead of the newswire. The goal of this work is to identify events from social streams. The proposed model utilizes sliding window-based statistical techniques to extract event candidates from social streams. Subsequently, the “Concept-based evolving…

Social networks, which have become extremely popular in the twenty first century, contain a tremendous amount of user-generated content about real-world events. This user-generated content relays real-world events as they happen, and sometimes even ahead of the newswire. The goal of this work is to identify events from social streams. The proposed model utilizes sliding window-based statistical techniques to extract event candidates from social streams. Subsequently, the “Concept-based evolving graph sequences” approach is employed to verify information propagation trends of event candidates and to identify those events. The experimental results show the usefulness of our approach in identifying real-world events in social streams.

Other authors
See publication
EmoViz: Mining the World's Interest through Emotion Analysis

ASONAM - IEEE 2015
Today, most personalized and recommendation services are built around interest extraction models but the outputs of these algorithms are ambiguous in nature. This makes it difficult to understand what users are personally interested in and more importantly what they are feeling towards these interests and how their interests transition through time. By studying both users' interests and emotions, simultaneously, one can further investigate the motivation behind these interests. Such findings…

Today, most personalized and recommendation services are built around interest extraction models but the outputs of these algorithms are ambiguous in nature. This makes it difficult to understand what users are personally interested in and more importantly what they are feeling towards these interests and how their interests transition through time. By studying both users' interests and emotions, simultaneously, one can further investigate the motivation behind these interests. Such findings can be useful to build better interest extraction models and algorithms that leverage personalized and recommendation services (e.g., ads. targeting, e-commerce and dating sites). In this paper, we propose the demonstration of a web visualization tool - EmoViz - which facilitates the further exploration of users' interests and their emotions at a global scale. Such tool, through the use of various visual components, aims to alleviate the problem of understanding what users of the world are interested in and the motivations behind their interests and feelings.

Other authors
See publication

Projects

Prompt Engineering Guide

Mar 2023

https://www.promptingguide.ai/
ML Papers of the Week

Jan 2023

A newsletter to bring you the latest research developments in ML and LLMs.

https://www.linkedin.com/newsletters/top-ml-papers-of-the-week-7020865424875474944/
Modern Deep Learning Techniques Applied to Natural Language Processing

Nov 2018 - Present

This project contains an overview of recent trends in deep learning based natural language processing (NLP). It covers the theoretical descriptions and implementation details behind deep learning models, such as recurrent neural networks (RNNs), convolutional neural networks (CNNs), and reinforcement learning, used to solve various NLP tasks and applications. The overview also contains a summary of state of the art results for NLP tasks such as machine translation, question answering, and…

This project contains an overview of recent trends in deep learning based natural language processing (NLP). It covers the theoretical descriptions and implementation details behind deep learning models, such as recurrent neural networks (RNNs), convolutional neural networks (CNNs), and reinforcement learning, used to solve various NLP tasks and applications. The overview also contains a summary of state of the art results for NLP tasks such as machine translation, question answering, and dialogue systems.

See project
DAIR.ai

Jan 2018 - Present

Democratizing Artificial Intelligence Research, Education, and Technologies

See project

Honors & Awards

Phi Tau Phi Scholastic Honor

The Phi Tau Phi Scholastic Honor Society of the Republic of China

Jun 2019

Awarded for achieving academic excellence during doctoral studies. This includes recognition for several research publications and a perfect GPA.

Languages

Spanish

Native or bilingual proficiency
Chinese

Elementary proficiency
English

Native or bilingual proficiency
Creoles and pidgins, English-based

Native or bilingual proficiency

View Elvis’ full profile

See who you know in common
Get introduced
Contact Elvis directly

Join to view full profile

Other similar profiles

Mohammad Akbari

Mohammad Akbari

AI Markets Group

3K followers
United Kingdom

View Profile
Subhrajit Roy

Subhrajit Roy

Google

4K followers
Mountain View, CA

View Profile
Sagar Joglekar

Sagar Joglekar

Intercom

1K followers
London Area, United Kingdom

View Profile
Nikolay Savinov

Nikolay Savinov

Google DeepMind

1K followers
London

View Profile
Ricardo Pio Monti

Ricardo Pio Monti

Google DeepMind

4K followers
San Francisco, CA

View Profile
Christian Fügen

Christian Fügen

Meta

1K followers
London Area, United Kingdom

View Profile
Nantas Nardelli

Nantas Nardelli

2K followers
London

View Profile
Giorgio Roffo

Giorgio Roffo

Equixly API Security

2K followers
Italy

View Profile
Aja Huang

Aja Huang

Google DeepMind

2K followers
Greater London

View Profile
Martin Szummer

Martin Szummer

Google DeepMind

2K followers
Greater London

View Profile
Yaroslav Ganin

Yaroslav Ganin

OpenAI

764 followers
United Kingdom

View Profile
Mehdi Mirza

Mehdi Mirza

DeepMind

1K followers
United Kingdom

View Profile
Viorica Patraucean

Viorica Patraucean

Google DeepMind

2K followers
London

View Profile
Francesco Visin

Francesco Visin

Google DeepMind

4K followers
United Kingdom

View Profile
Diane Bouchacourt

Diane Bouchacourt

Doctolib

1K followers
France

View Profile
Fabio Petroni

Fabio Petroni

EMBL

4K followers
Rome

View Profile
Dr. Cédric Mesnage

Dr. Cédric Mesnage

University of Exeter

1K followers
Exeter

View Profile
Sander Dieleman

Sander Dieleman

Google DeepMind

5K followers
United Kingdom

View Profile
Pablo Sprechmann

Pablo Sprechmann

DeepMind

4K followers
London

View Profile
Jovana Mitrovic

Jovana Mitrovic

DeepMind

1K followers
London

View Profile

Explore more posts

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Elvis S.

77 others named Elvis S. are on LinkedIn

See others named Elvis S.

About

Services

Articles by Elvis

OpenAI Introduces Operator & Agents

My Favorite LLM Papers for October

Tracking LLMs with Comet

How To Build a Custom Chat LLM on Your Data

Data Exploration with Chat Powered by GPT-4

Open Source Solution Replicates ChatGPT Training Process

New Conversational AI Tool Lets You “Chat” With Your Data

Analyzing Worldwide Energy Production with Kibana Lens

XLNet outperforms BERT on several NLP Tasks

Activity

86K followers

Elvis S.

Elvis S.

Elvis S.

Elvis S.

Intology

Intology

Elvis S.

Elvis S.

Elvis S.

Elvis S.

DAIR.AI

n8n

Experience & Education

DAIR.AI

*******

View Elvis’s full experience

See their title, tenure and more.

Licenses & Certifications

Publications

arXiv November 16, 2022

Empirical Methods in Natural Language Processing (EMNLP) 2018

ASONAM - IEEE 2017

Conference on Technologies and Applications of Artificial Intelligence (TAAI) - IEEE 2017

ASONAM - IEEE 2016

ASONAM - IEEE 2016

Social Network Analysis and Mining - Springer 2016

Social Network Analysis and Mining - Springer 2015

ASONAM - IEEE 2015

Projects

Prompt Engineering Guide

Mar 2023

ML Papers of the Week

Jan 2023

Nov 2018 - Present

Jan 2018 - Present

Honors & Awards

Phi Tau Phi Scholastic Honor

The Phi Tau Phi Scholastic Honor Society of the Republic of China

Languages

Spanish

Native or bilingual proficiency

Chinese

Elementary proficiency

English

Native or bilingual proficiency

Creoles and pidgins, English-based

Native or bilingual proficiency

View Elvis’ full profile

Other similar profiles

Mohammad Akbari

Subhrajit Roy

Sagar Joglekar

Nikolay Savinov

Ricardo Pio Monti

Christian Fügen

Nantas Nardelli

Giorgio Roffo

Aja Huang

Martin Szummer

Yaroslav Ganin

Mehdi Mirza

Viorica Patraucean

Francesco Visin

Diane Bouchacourt

Fabio Petroni