Bingyan Liu

Bingyan Liu

Mountain View, California, United States
570 followers 500+ connections

About

Team Leader, I build or rebuild teams into success and bring the best out of people…

Activity

Join now to see all activity

Experience

  • Kargo Graphic

    Kargo

    San Francisco Bay Area

  • -

    San Francisco Bay Area

  • -

    San Francisco Bay Area

  • -

    Menlo Park, California

  • -

    Mountain View, California

  • -

    Beijing City, China

  • -

    Beijing City, China

Education

Publications

  • Distributional Similarity on Large Chinese Web Collection

    CCIR 2013

    Distributional word similarity on web scale Chinese web collection is an important and widespread
    used preprocess phase in many NLP tasks. On English web collections vector-based models are widely used to measure word similarity out of context. This paper proposed an unsupervised Chinese word similarity measuring system based on context vector. We used the system measuring Chinese word similarity on web scale dataset and verified the Harris’ distribution hypothesis on Chinese corpus. The…

    Distributional word similarity on web scale Chinese web collection is an important and widespread
    used preprocess phase in many NLP tasks. On English web collections vector-based models are widely used to measure word similarity out of context. This paper proposed an unsupervised Chinese word similarity measuring system based on context vector. We used the system measuring Chinese word similarity on web scale dataset and verified the Harris’ distribution hypothesis on Chinese corpus. The system shows high performance and well effect. In experiment this paper compared impact of each model’s parameters on final results.

    Other authors
  • Improving Performance of Chinese Word Segmentation on Large Dataset

    CCIR 2013

    Chinese word segmentation is a fundamental problem in the area of natural language processing
    problem. With the growth of Internet, Chinese word segmentation faces greater challenge of bigger data scale. The design of the word segmentation system requires trade-offs between word segmentation speed and precision rate. Current research mostly concentrates on improving precision rate using complex word segmentation model. This paper proposes a new Chinese word segmentation system to improve the…

    Chinese word segmentation is a fundamental problem in the area of natural language processing
    problem. With the growth of Internet, Chinese word segmentation faces greater challenge of bigger data scale. The design of the word segmentation system requires trade-offs between word segmentation speed and precision rate. Current research mostly concentrates on improving precision rate using complex word segmentation model. This paper proposes a new Chinese word segmentation system to improve the system’s data through-put and speed. The system is capable of processing of web scale Chinese webpage dataset quickly. Experiment shows current system exploits cluster system performance very well and yields a performance speed-up to 136 times compared to current single-thread baseline which is capable of processing web scale data efficiently.

    Other authors

Courses

  • Quantum field theory

    -

Languages

  • English

    -

  • Chinese (Simplified)

    -

More activity by Bingyan

View Bingyan’s full profile

  • See who you know in common
  • Get introduced
  • Contact Bingyan directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Bingyan Liu in United States

Add new skills with these courses