“Huajun is a very accomplished AI/ML leader with rich industrial experience, comprehensive knowledge, and strong hands-on capabilities. For couple of years, we closely collaborated in applying cutting-edge conversational AI technologies to innovate customers experience and agents experience in healthcare. He demonstrated great passion and execution, helped mentoring machine learning engineers in my team, made significant contribution to our products. I would partner with him again without any hesitation.”
About
Experienced engineering leader in AL/ML with strong strategic planning, team…
Activity
-
The team is at NAACL this week speaking to some of the smartest in AI computational linguistics and NLP! We're fast growing and want to work with the…
The team is at NAACL this week speaking to some of the smartest in AI computational linguistics and NLP! We're fast growing and want to work with the…
Liked by Huajun Zeng
-
Our paper about the CRAG (Comprehensive RAG) benchmark is listed in Hugging Face "Daily papers"! I will talk about the benchmark and share our…
Our paper about the CRAG (Comprehensive RAG) benchmark is listed in Hugging Face "Daily papers"! I will talk about the benchmark and share our…
Liked by Huajun Zeng
-
A late update, I graduated from Carnegie Mellon University with a masters degree in artificial intelligence (MSAII)! Thanks Dr. Michael Shamos for…
A late update, I graduated from Carnegie Mellon University with a masters degree in artificial intelligence (MSAII)! Thanks Dr. Michael Shamos for…
Liked by Huajun Zeng
Experience
Education
Publications
-
LCQMC: A Large-scale Chinese Question Matching Corpus
COLING 2018
-
An experimental study on large-scale web categorization
Special interest tracks and posters of the 14th international conference on World Wide Web
-
A practical system of keyphrase extraction for web pages
CIKM 2005
-
A unified framework for clustering heterogeneous web objects
WISE 2002
-
Applying associative relationship on the clickthrough data to improve web search
ECIR 2005
-
BlogViewer: A Topic-Airscape Explorer for Blogspace
-
-
Cbc: Clustering based text classification requiring minimal labeled data
ICDM 2003
-
Cubesvd: a novel approach to personalized web search
WWW 2005
-
CWS: a comparative web search system
WWW 2006
-
DCFLA: A distributed collaborative-filtering neighbor-locating algorithm
Inf. Sci. 177
-
Demographic prediction based on user's browsing behavior
WWW 2007
-
Enabling personalization services on the edge
ACM Multimedia 2002
-
Enhancing text clustering by leveraging Wikipedia semantics
SIGIR 2008
-
Exploiting pagerank at different block level
WISE 2004
-
Exploiting the hierarchical structure for link analysis
SIGIR 2005
-
Finding group shilling in recommendation system
WWW (Special interest tracks and posters) 2005
-
Finding keyword from online broadcasting content for targeted advertising
-
-
Homepage live: automatic block tracing for web personalization
WWW 2007
-
Implicit link analysis for small web search
SIGIR 2003
-
Importance-based web page classification using cost-sensitive SVM
WAIM 2005
-
Improving Query Translation for Cross-Language Information Retrieval using a Web-based Approach
-
-
Improving text classification by using encyclopedia knowledge
ICDM 2007
-
Irc: An iterative reinforcement categorization algorithm for interrelated web objects
ICDM 2004
-
KNM: a novel intelligent user interface for webpage navigation
AIRS 2005
-
Learning to cluster web search results
SIGIR 2004
-
Log Mining to Improve the Performance of Site Search.
WISE Workshops 2002
-
Mining clickthrough data for collaborative web search
WWW 2006
-
MRSSA: An iterative algorithm for similarity spreading over interrelated objects
CIKM 2004
-
Optimizing web search using spreading activation on the clickthrough data
WISE 2004
-
Optimizing web search using web click-through data
CIKM 2004
-
ReCoM: reinforcement clustering of multi-type interrelated data objects
SIGIR 2003
-
Reinforcing web-object categorization through interrelationships
Data Min. Knowl. Discov. 12
-
Scalable collaborative filtering using cluster-based smoothing
SIGIR 2005
-
Similarity spreading: a unified framework for similarity calculation of interrelated objects
WWW (Alternate Track Papers & Posters) 2004
-
Subjectivity Categorization of Weblog with Part-of-Speech Based Smoothing
ICDM 2006
-
Supervised latent semantic indexing for document categorization
ICDM 2004
-
Support vector machines classification with a very large-scale taxonomy
SIGKDD Explorations 7
-
User Access Pattern Enhanced Small Web Search.
WWW (Posters) 2003
-
Using probabilistic latent semantic analysis for personalized web search
APWeb 2005
-
Using Wikipedia knowledge to improve text classification
Knowl. Inf. Syst. 19
-
Web page clustering enhanced by summarization
CIKM 2004
-
Web-page classification through summarization
SIGIR 2004
-
Web-page summarization using clickthrough data
SIGIR 2005
Patents
-
Advertising keyword cross-selling
US 7788131
-
Advertising service based on content and user log mining
US 8122049
-
Augmenting user, query, and document triplets using singular value decomposition
US 7747618
-
Block tracking mechanism for web personalization
US 7818330
-
Click-through log mining
US 8321448
-
Cluster-based scalable collaborative filtering
US 8738467
-
Clustering based text classification
US 7366705
-
Comparative web search
US 7571162
-
Comparatively crawling web page data records relative to a template
US 7555480
-
Content propagation for enhanced document retrieval
US 7305389
-
Creating home pages based on user-selected information of web pages
US 7594013
-
Distributed hierarchical text classification framework
US 7809723
-
Diverse topic phrase extraction
US 8280877
-
Document characterization using a tensor space model
US 7529719
-
Efficient retrieval algorithm by query term discrimination
US 7822752
-
Efficient retrieval algorithm by query term discrimination
US 7925644
-
Enhanced document retrieval
US 7289985
-
Evaluating and generating summaries using normalized probabilities
US 7565372
-
Generation of contextual image-containing advertisements
US 8417568
-
Identification of topics for online discussions based on language patterns
US 7739261
-
Identifying influential persons in a social network
US 8359276
-
Implicit links search enhancement system and method for search engines using implicit links generated by mining user access patterns
US 7584181
-
Key phrase navigation map for document navigation
US 7861149
-
Keyword usage score based on frequency impulse and frequency weight
US 7644075
-
Location updates for a distributed data store
US 8484417
-
Location updates for a distributed data store
US 8108612
-
Method and system for adapting search results to personal information needs
US 7630976
-
Method and system for adapting search results to personal information needs
US 7849089
-
Method and system for calculating document importance using document classifications
US 7774340
-
Method and system for classifying and identifying messages as question or not a question within a discussion thread
US 7590603
-
Method and system for classifying display pages using summaries
US 7392474
-
Method and system for clustering using generalized sentence patterns
US 7584100
-
Method and system for detecting when an outgoing communication contains certain content
US 7594277
-
Method and system for detecting when an outgoing communication contains certain content
US 8782805
-
Method and system for determining similarity of items based on similarity objects and their features
US 7533094
-
Method and system for determining similarity of objects based on heterogeneous relationships
US 7376643
-
Method and system for incrementally learning an adaptive subspace by optimizing the maximum margin criterion
US 7502495
-
Method and system for mining information based on relationships
US 7529735
-
Method and system for mining information based on relationships
US 9195942
-
Method and system for prioritizing communications based on interpersonal relationships
US 7917587
-
Method and system for prioritizing communications based on sentence classifications
US 8112268
-
Method and system for prioritizing communications based on sentence classifications
US 7567895
-
Method and system for ranking documents of a search result to improve diversity and information richness
US 7664735
-
Method and system for ranking messages of discussion threads
US 7437382
-
Method and system for ranking objects based on intra-type and inter-type relationships
US 7346621
-
Method and system for summarizing a document
US 7698339
-
Object clustering using inter-layer links
US 7194466
-
Peer-to-peer advertisement platform
US 8548853
-
Person disambiguation using name entity extraction-based clustering
US 7685201
-
Query-based snippet clustering for search result grouping
US 7617176
-
Reinforced clustering of multi-type data objects for search term suggestion
US 7689585
-
Scalable probabilistic latent semantic analysis
US 7844449
-
Search engine enhancement using mined implicit links
US 8312035
-
System and method for utilizing the content of an online conversation to select advertising content and/or other relevant information for display
US 7653627
-
Term suggestion for multi-sense query
US 7428529
-
Text classification by weighted proximal support vector machine based on positive and negative sample sizes and weights
US 7707129
-
User query mining for advertising matching
US 8285745
-
Verifying relevance between keywords and web site contents
US 7260568
-
Web page ranking with hierarchical considerations
US 7779001
Projects
-
AllenNLP
- Present
An Apache 2.0 NLP research library, built on PyTorch, for developing state-of-the-art deep learning models on a wide variety of linguistic tasks.
-
Service Fabric
- Present
Service Fabric is a distributed systems platform for packaging, deploying, and managing stateless and stateful distributed applications and containers at large scale. Service Fabric runs on Windows and Linux, on any cloud, any datacenter, across geographic regions, or on your laptop.
Recommendations received
2 people have recommended Huajun
Join now to viewMore activity by Huajun
-
Thanks to CNET for including Otter.ai in the Editors' Choice Awards for Best of AI! The CNET product review team awarded an Editors' Choice Award…
Thanks to CNET for including Otter.ai in the Editors' Choice Awards for Best of AI! The CNET product review team awarded an Editors' Choice Award…
Liked by Huajun Zeng
-
GenAI Summit brought together thousands of professionals, researchers, students, and business leaders, to discuss the latest in GenAI. That's a lot…
GenAI Summit brought together thousands of professionals, researchers, students, and business leaders, to discuss the latest in GenAI. That's a lot…
Liked by Huajun Zeng
-
Today is my first day at LinkedIn. I'm very happy to join the AI growth team. I love the culture and what LinkedIn represents: connecting the world's…
Today is my first day at LinkedIn. I'm very happy to join the AI growth team. I love the culture and what LinkedIn represents: connecting the world's…
Liked by Huajun Zeng
-
🔥 Meet Dr. Sam Liang, CEO & Founder of Otter.ai, speaking at #GENAISummitSF2024 on "AI for Work: How GenAI Improves the Way We Work and Collaborate"…
🔥 Meet Dr. Sam Liang, CEO & Founder of Otter.ai, speaking at #GENAISummitSF2024 on "AI for Work: How GenAI Improves the Way We Work and Collaborate"…
Liked by Huajun Zeng
-
OpenAI is expected to demo a real-time voice assistant tomorrow. What does it take to deliver an immersive, or even magical experience? Almost all…
OpenAI is expected to demo a real-time voice assistant tomorrow. What does it take to deliver an immersive, or even magical experience? Almost all…
Liked by Huajun Zeng
-
Some have argued that frontier AI models should not be open sourced because it would enable geopolitical adversaries, such as China, to get their…
Some have argued that frontier AI models should not be open sourced because it would enable geopolitical adversaries, such as China, to get their…
Liked by Huajun Zeng
-
We've come a long way in a year and we're not slowing down! Here's a look back at the Otter team less than a year ago and a glimpse of this week's…
We've come a long way in a year and we're not slowing down! Here's a look back at the Otter team less than a year ago and a glimpse of this week's…
Liked by Huajun Zeng
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore MoreOthers named Huajun Zeng
-
HUAJUN ZENG
Student at Fudan University
-
Huajun Zeng
-
Huajun Zeng
--
-
曾华军
AUT University - System Network Support Consultant
54 others named Huajun Zeng are on LinkedIn
See others named Huajun Zeng