James Horey

James Horey

Knoxville, Tennessee, United States
7K followers 500+ connections

About

CEO of Slideform - a platform to help agencies & client-facing teams automate their…

Articles by James

Activity

Join now to see all activity

Experience

  • Slideform Inc. Graphic

    Slideform Inc.

    Knoxville, Tennessee, United States

  • -

    Knoxville, Tennessee, United States

  • -

    Knoxville, Tennessee

  • -

    www.cirrusinsight.com

  • -

    knoxville area, tennessee

  • -

  • -

  • -

  • -

Education

  • The University of New Mexico Graphic

    The University of New Mexico

    -

    Dissertation: A Hierarchical Group Model for Programming Sensor Networks
    Graduation with Distinction

  • -

Publications

  • Performance implications from sizing a VM on multi-core systems: A Data analytic application's view

    High-Performance Grid and Cloud Computing Workshop (with IPDPS'13)

    Virtual machines on multi-core environments form the core of cloud computing. In addition, data analytics applications, such as Cassandra and Hadoop, are becoming increasingly popular on cloud computing platforms. This convergence necessitates a better understanding of the performance and cost implications of such hybrid systems. For example, the very first step in hosting applications in virtualized environments, requires the user to configure the number of virtual processors and the size of…

    Virtual machines on multi-core environments form the core of cloud computing. In addition, data analytics applications, such as Cassandra and Hadoop, are becoming increasingly popular on cloud computing platforms. This convergence necessitates a better understanding of the performance and cost implications of such hybrid systems. For example, the very first step in hosting applications in virtualized environments, requires the user to configure the number of virtual processors and the size of memory. In this paper, we present a quantitative performance analysis of data analytics applications running on multi-core virtual machines.

    Other authors
  • Big Data Platforms as a Service: Challenges and Approach

    Usenix Workshop on Hot Topics in Cloud Computing

    Other authors
  • Design Principles for Effective Knowledge Discovery from Big Data

    IEEE Conference on Software Architecture

    Other authors
  • Enhancing Privacy in Participatory Sensing Applications with Multidimensional Data

    IEEE International Conference on Pervasive Computing and Communications

    Other authors
  • Reconstructing Spatial Distributions from Anonymized Locations

    Workshop on Secure Data Management on Smartphones and Mobiles

    Other authors

Projects

  • Ferry

    Ferry is an opens-ource, big data development environment. Ferry lets you define, run, and deploy big data stacks on your local machine using Docker. It currently supports Hadoop, Cassandra, and GlusterFS/OpenMPI.

    See project
  • Paja

    Paja is an implementation for Paxos for Java. The basic idea is that you need multiple "replicas" to all agree upon a certain order of events even if each replica receives individual events in a different order (due to concurrent client submissions). In addition, paxos is designed to ensure that all replicas agree upon a global order even if a certain number of nodes randomly fail and restart.

    See project
  • KevaDB

    - Present

    Keva is an open-source key-value store that features a simple API to store and retrieve arbitrary key-value pairs and safely persists all data to disk. It is intended to be used as a library within the context of another application. Special features worth noting include:

    (1) Keva is implemented in Java and heavily utilizes the concurrent and nio libraries
    (2) Keva supports the idea of different "branches" associated with a particular key
    (3) All values within a branch are appended…

    Keva is an open-source key-value store that features a simple API to store and retrieve arbitrary key-value pairs and safely persists all data to disk. It is intended to be used as a library within the context of another application. Special features worth noting include:

    (1) Keva is implemented in Java and heavily utilizes the concurrent and nio libraries
    (2) Keva supports the idea of different "branches" associated with a particular key
    (3) All values within a branch are appended into a history (there are no in-place updates)

    See project
  • Spot Hadoop

    - Present

    Spot-Hadoop is a tool to dynamically configure, launch, and benchmark Hadoop on shared, parallel filesystems (namely Lustre). The primary motivation behind this project is get a dynamic Hadoop tool (in a manner similar to Amazon's EMR service) running on Oak Ridge National Lab's infrastructure. As ORNL runs primarily large-scale MPI clusters with a shared Lustre filesystem, it is unclear exactly the best way to actually accomplish this. This project attempts to explore both the software…

    Spot-Hadoop is a tool to dynamically configure, launch, and benchmark Hadoop on shared, parallel filesystems (namely Lustre). The primary motivation behind this project is get a dynamic Hadoop tool (in a manner similar to Amazon's EMR service) running on Oak Ridge National Lab's infrastructure. As ORNL runs primarily large-scale MPI clusters with a shared Lustre filesystem, it is unclear exactly the best way to actually accomplish this. This project attempts to explore both the software architecture and performance issues related to getting Hadoop running on this environment.

    Other creators
    See project
  • CMS Knowledge Discovery Infrastructure

    We helped the Centers for Medicare & Medicaid Services develop a "Knowledge Discovery Infrastructure" at ORNL to facilitate large-scale data analysis and visualization. We used the KDI to explore provider and beneficiary relationships, and also to help identify irregularities in claims processing.

    Links from the…

    We helped the Centers for Medicare & Medicaid Services develop a "Knowledge Discovery Infrastructure" at ORNL to facilitate large-scale data analysis and visualization. We used the KDI to explore provider and beneficiary relationships, and also to help identify irregularities in claims processing.

    Links from the press:
    http://gcn.com/articles/2012/03/09/cms-analytics-project-health-care-big-data.aspx
    http://technews.tmcnet.com/knowledge-management/topics/knowledge-management/articles/303131-cms-unveils-knowledge-discovery-infrastructure-aims-patient-data.htm
    http://www.fiercegovernmentit.com/story/doe-labs-help-cms-manage-healthcare-data/2012-10-22

    Other creators

Languages

  • Korean

    -

Organizations

  • KnoxFounders.com

    Co-organizer

    - Present

    Community of Knoxville's Founders and Entrepreneurs

  • KnoxScience.com

    Co-organizer

    - Present

    Community of Knoxville's Data Scientists

More activity by James

View James’ full profile

  • See who you know in common
  • Get introduced
  • Contact James directly
Join to view full profile

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named James Horey

Add new skills with these courses