Skip to content

Instantly share code, notes, and snippets.

@ryanswanstrom
Created March 29, 2012 02:57
Show Gist options
  • Save ryanswanstrom/2232803 to your computer and use it in GitHub Desktop.
Save ryanswanstrom/2232803 to your computer and use it in GitHub Desktop.
A collection of 16 data scientist job posts
Job Category: IT
Location: United States, WA, Redmond
Job ID: 786045
Product: (Not Product Specific)
Division: IT
The Data & Decision sciences Team within technology Office in MSIT, helps drive actionable business intelligence through advanced statistical modeling and business analytics, throughout Microsoft. The team focuses on enterprise level engagement, where advanced data mining and modeling skills are needed to find business insights. We have a strong team of experienced statistician with an average experience of 15+ years in various industries. Currently we are expanding our work in areas of simulation, system modeling and text mining, to support our internal clients. Our goal is to support business performance enabled through data analysis, statistical modeling - resulting into business impact. Candidate will be expected to work with internal clients on analytic projects and identify potential improvement opportunities. The candidate must possess a passion for advanced analytics and skills in areas of data mining.
Core Job Responsibilities:
The right candidate will have a combination of advanced statistical analysis skills and an interest in driving business impact through actionable analytic insight. He/she should be able to independently run analytic projects and help our internal clients use the findings to drive their business.
Conduct and manage analysis and modeling - in the areas of text mining, sentiment analysis, forecasting and simulation. Perform exploratory data analysis; build behavioral models for uses and partners. Utilize data mining technologies and use various data sources, some of which may include Sales and other financial data, 3rd party demographic, Partner data, to gain insight into KPIs. Provide complete solutions to business problems using data analysis, data mining techniques, and statistics. Work on projects may include case studies, methodology research, tool evaluations, model development/refinement and deployment. Serve as subject matter expert on analytics for Microsoft IT.
• Preferred experience solving business problems using modeling, text/data mining on large scale real-world business data sets
• Experience with various data analysis tools, data mining tools, and statistical packages
• Familiarity with applying data mining techniques (i.e. classification, & clustering, sentiment analysis through text mining, time series forecasting, Simulation and system modeling), statistics and information retrieval methods to real-world data
Familiarity of system modeling and simulation is desirable but not necessary
• Ability to work independently or to manage a virtual team that will research innovative solutions to challenging business problems
• Ability to collaborate with partners and drive analytic projects end to end,
• Superior communication skills, both verbal and written
• Attention to detail and data accuracy
• Preferred fluency in the following: SQL, SAS, Minitab
o 80% of the time driving multiple analytic projects with high complexity, strategic value, and executive visibility
o 20% of the time sharing best practices and growing the culture of data driven decision making in Microsoft
Minimum 5+ years of related experience
Master’s degree required, preferably in statistics, Mathematics, CS or Economics
o Consulting back ground preferred
--
Netflix is seeking an outgoing, curious, interdisciplinary data expert to work as a statistician, data miner, and business analyst. In this role you will have the opportunity to work across many areas of Netflix to analyze financial, operational, web, and consumer behavior. You will work with a variety of people and technologies to bring about advanced insights into our business.
Your work will help management be better informed to make the right decisions for our customers, our suppliers, and to improve our overall efficiency and profitability. At Netflix, our culture and management is committed to being a fact-based and analysis driven organization; because of this, your contributions will be of great interest to the various business areas. To succeed in this role you should be passionate about delivering accurate and actionable analyses to business decision-makers and be able to understand and creatively solve problems that span statistics, technology, and business. You should have exceptional communication skills and be adept at translating business questions into meaningful analysis plans.
One principal area of responsibility will be to help develop new and to tune existing statistical models used for demand prediction for our extensive library of streaming content. We’re proud of our expertise we’re developing in understanding demand and ultimately translating this into consumer value – so come be a part of the story!
Qualifications:
• MS/PhD. degree in Statistics, Mathematics, Operations Research, CS, Econometrics or equivalent/related degree.
• 4+ years relevant experience with a proven track record of leveraging data analysis to drive significant business impact.
• Expertise in predictive modeling. Must have knowledge/experience in some/all of the following: Multivariate Regression, Logistic Regression, Support Vector Machines, Bagging, Boosting, Decision Trees, and Time Series Analysis.
• Experience in Optimization, Stochastic Processes, Experiment Analysis, and/or Bootstrapping a plus.
• Proficiency in at least one statistical analysis tools such as R, SAS, and/or Weka.
• Above average capabilities with basic analysis tools of SQL and Excel.
• Proven interpersonal, communication and presentation skills – must be able to explain technical concepts and analysis implications clearly to a wide audience and be able to translate business objectives into actionable analyses.
• Java/C++/ Python/Perl programming skills, while not required, are considered a plus.
--
As a Kaggle data scientist, you will be crucial to the company’s mission. You will work with customers to prepare their competitions, from problem identification to data cleaning and exploration, and build tools to automate this process. At the end of the competition you'll verify the winning models, and integrate those models into our customers’ operations. Your role will go to the heart of Kaggle’s value proposition to clients.
As a Kaggle data scientist, you will have access to a huge range of data sets and customers. Over the last year, we have worked on data sets relating to:
galaxy shape measurement to help NASA and the ESA map dark matter;
the reliability of used cars, to work out which are likely to be lemons;
genetic markers, to predict viral load in HIV; and
grocery store visits, to help dunhumby predict the timing of future visits and the amount likely to be spent.
You will also get to see first-hand which techniques work on which data sets, and to hang out with some of the world’s greatest data scientists.
On a typical day, a Kaggle data scientist will:
work with a customer to structure their competition
roller blade down the Embarcadero
read and discuss thrillers like Chris Bishops’ Pattern Recognition and Machine Learning
create a benchmark in R / Matlab / Python
write an evaluation metric in C#
build data anonymization tools
correspond with the Kaggle community on our forums
explore the winning solution to one of our competitions
--
Data Scientist
See who you know at Greenplum
San Mateo, CA, United States
Job Type: Full-Time
If you are looking for a career where your services will be in high demand, you should find something where you provide a scarce, complementary service to something that is getting ubiquitous and cheap. So what’s getting ubiquitous and cheap? Data. And what is complementary to data? Analysis.
– Prof. Hal Varian, UC Berkeley, Chief Economist at Google.
Building the world's largest multi-technology distributed computing platforms, leveraging massively parallel processing and Hadoop, Greenplum is committed to pioneering data-intensive analytic processing. We are looking for talented individuals to join our growing analytics solutions team, which is developing a fundamentally new approach to how business generates value from data. This team will develop new methodologies and tools to enable advanced modeling and statistical analysis against petabyte-scale data sets.
Our Data Scientists create new sources of data-driven innovation and value for our customers and prospects while proving the analytical capabilities of Greenplum’s technology stack. They help customers respond to their top-priority business goals by introducing new ways to tap their exploding data assets, using statistical methods, machine learning, and predictive models. Greenplum Data Scientists develop new practices and methodologies for working with the Greenplum data warehouse technology, often working closely with leading academics and industry experts. They also work with industry-leading engineers to create new tools and features that support sophisticated analytics within the database.
Greenplum is aggressively building out its Data Scientist team and is looking for the best and brightest subject-matter experts across verticals and analytical techniques. If you bring a deep focus and passion for a particular specialization within analytics (e.g., text or sentiment analytics, network/graph analytics, time series modeling, etc.), or a key vertical market (i.e., life sciences, digital media, energy, etc.) we would love to talk to you!
Responsibilities:
Use techniques from supervised and unsupervised machine learning, statistical analysis and predictive modeling to deliver business insights to prospects and customers based on data stored in Greenplum data warehouses.
Work directly with customers to educate them on “moving beyond BI” and training their internal resources to execute advanced forms of analytics on Greenplum.
Create re-usable implementations of statistical tests and models using the available technologies in the Greenplum database.
Work with the academic and business community to develop new techniques and to contribute to research in the area of advanced analytics on large databases.
Generate new product requirements for the Greenplum engineering group to enhance the analytics capabilities of the database.
Assist in customer engagement management, requirements definition, project scoping, timeline management, and results documentation to ensure professional relationship management.
Requirements:
A proven passion for generating insights from data, with a strong familiarity with the higher-level trends in data growth, open-source platforms, and public data sets.
Strong knowledge of statistical methods generally, and particularly in the areas of modeling and business analytics
Experience with statistical languages and packages, including R, S-Plus, SAS and Matlab, and/or Mahout
Experience working with relational databases and/or distributed computing platforms, and their query interfaces, such as SQL, MapReduce, PIG, and Hive.
Preferably experience with additional programming languages, including Python, Java, and C/C++.
Familiarity with visualization software and techniques (including Tableau), and business intelligence (BI) software, such as Microstrategy, Cognos, Pentaho, etc.
Preferably, experience working hands-on with large-scale data sets
Advanced degree (PhD or Masters) in an analytical or technical field (e.g. applied mathematics, statistics, physics, computer science, operations research)
A strong business-orientation, able to select the appropriate complex quantitative methodologies in response to specific business goals
A team player, capable of conducting independent research, who is excited by and motivated by hard technical challenges
Results-driven, self-motivated, self-starter
Excellent written and verbal communications skills, with a proven ability to translate complex methodologies and analytical results to higher-level business insights and key take-aways
Ability to travel as-needed to meet with customers
Greenplum is setting the pace in the Big Data Analytics space. We are growing rapidly and providing solutions to major companies in the industry. We offer individuals a generous compensation/benefits package as well as equity in our company.
Additional Relevant Skills (college hires):
Algorithm benchmarking across tool kits
Ability to develop tools within MadLib
QA data set within MadLib
QA data set, run algorithms
SQL, C++, Java
Quantative - strong background
Hadoop
Text Analytics
--
Data Scientist
Purpose & Background of Role
We're seeking a top notch data scientist with strong programming skills to join the small and very enthusiastic Music Information Retrieval team at Last.fm. This is a full-time position, based in London.
Are you an experienced scientist as well as a hands-on implementer who is able to work around constraints in disk speed, memory size and CPU cycles? Do you want to help design, implement and evaluate playlisting and recommendation algorithms that reach millions of music lovers each month? Do you take pride in being clever and solving difficult problems creatively? Are you full of ideas and always looking for new ways of making use out of data? Are you an advocate for data-driven development and fully capable of conducting a proper A/B test?
The MIR team works with large volumes of audio (millions of tracks), user data (tens of billions of scrobbles), and metadata (7,593 different spellings of Britney Spears). We run data processing and analysis jobs on our own Hadoop Cluster, and write and maintain scalable realtime services including several recommendation and playlisting engines. We encourage publication of research results and contribution to open source projects.
We're looking for a talented and enthusiastic scientist to help us explore and learn from our rich datasets, build services and infrastructure, conduct data-driven evaluations and experiments, improve our recommendation and playlisting services, and help drive innovation in our website, API, client and mobile applications.
Requirements:
•Highly fluent in Python and either C++ or Java (or both)
•Passion for machine learning and data mining
•Proficient with databases, both relational and non-relational
•Experience with Hadoop and analysing terabyte-scale datasets
•Familiar with data-driven development and split testing
•Basic understanding of common web technologies
• Comfortable with the Unix CLI and shell scripting
•MS or PhD in computer science or equivalent
• Track record in music information retrieval or recommender systems research is a plus
--
Job Description:
The Data Scientist will lead and participate in projects that require advanced data and business analysis skills including best in practice statistical methodologies and modeling and data mining.
Responsibilities:
- Work with Rackers to help identify opportunities using advanced analytics and work with the necessary teams (business, Enterprise Data Warehouse and Software Engineers) to implement the full analytical solution
- Perform data mining and exploratory analysis to identify opportunities
- Present analysis to business sponsors via data visualizations, white papers, power point presentations, etc
- Develop predictive models for operational use
- Select tool generated algorithms for expeditious solutions or, when necessary, develop custom algorithms via necessary means.
Job Requirements
Job Requirements
The ideal candidate will have the following:
- Strong working knowledge of data mining algorithms including decision trees, probability networks, association rules, clustering, regression, and neural networks
- Demonstrated working knowledge of statistical tools such as SAS, SPSS, ‘R’, etc. Strong SQL skills
- Expert knowledge and ability for mining structured and unstructured data
- Fundamental understanding of relational data modeling and experience with relational database platforms such as (but not limited to) Oracle, SQL Server, Postgres and etc
- Strong programming skills in Object Oriented language such as C++, Java or similar language
- Strong relationship management skill
- Excellent leadership, time management, communications, decision-making, presentation, human relations and organization skills
- Able to resolve problems in an effective and timely manner
- Ability to work independently and in a team environment
- Master’s degree in Machine Learning, Applied Statistics, Data Mining, or related field and five years of experience applying data mining techniques in a professional environment required
- Experience with leveraging technologies to allow for efficient data processing via distributed computing and data storage
- Prefer Ph.D. in Machine Learning, Applied Statistics, Data Mining, or relevant field
- Experience with popular open source distributed database platforms such as Hadoop and Cassandra a plus
- Experience with semantic web and social network analytics a plus
Rackspace, a world leader in hosting delivers enterprise-level managed hosting, cloud hosting, and e-mail hosting services to businesses of all types and sizes globally. Today we serve thousands of customers from data centers around the world. Rackspace integrates the industry's best technologies and practices for each customer's specific needs delivering it as a service via the company's commitment to Fanatical Support®. We serve as an extension of our customers' IT departments enabling them to focus on their core business. Rackspace was founded in 1998 and since then has had significant growth year after year. There are now over 3,000 Rackers employed worldwide dedicated to providing customers our premier hosting services.
Rackspace Hosting is an Equal Opportunity Employer.
The above information has been designed to indicate the general nature and level of work performed by employees in this classification. It is not designed to contain or to be interpreted as a comprehensive inventory of all duties, responsibilities, and qualifications required of the employee assigned to this job.
--
Job Description
Amazon Kindle offers the opportunity to work with vast stores of exceptionally rich data and implement business-impacting predictive models and decision support systems very rapidly. This position is in the world-wide demand planning team for Amazon's Kindle division including Kindle Fire and Kindle e-readers and the right candidate will have the opportunity to build our decision support systems.
The ideal candidate will be knowledgeable about best practice data modeling and analysis techniques and he or she will be adept at building systems that create value out of data analysis. He or she will:
Implement systems and work with developers and data engineers to operationalize our models for use by the larger organization
Improve current statistical models (e.g. ARIMA, VAR, regression) and models of customer behavior (e.g. product diffusion models) by incorporating richer information and adding new modeling techniques or methodologies
Understand and predict customer behavior in the aggregate and by segmentations
Help the business to measure and manage risk associated with building devices
Interact with other teams within Amazon to ensure that we are capturing the best practices around the modeling of customer behavior and statistical modeling
Think creatively: Amazon is a place that is permanently pushing the boundaries of the industry. We are not looking for someone that will just execute database queries and implement a list of functional requirements. We want someone that will take it to the next level in terms of how to best use amazon’s information and amazon’s technological infrastructure
Generate Impact: We want someone obsessed with impact. We don’t want someone that is just interested in finding patterns in the data. We want someone that understands how to turn those discoveries into value for our customers every day.
Basic Qualifications
3+ years of experience with data analysis in a variety of quantitative fields.
Undergraduate degree in computer science or some experience in software development engineering
Graduate degree in statistics, finance or quantitative social science or 5+ years of experience building data analysis and decision support systems (OBIEE, etc)
Experience with SQL and with R, Stata, SAS or other statistical language
Strong written and verbal communication skills
Preferred Qualifications
Experience with data analysis in a variety of quantitative fields and experience building data analysis and decision support systems
3+ years of experience in software engineering or enterprise software implementation and configuration
PhD degree in a quantitative field
Passion about analyzing and creating value out of data
The candidate should be capable of, and motivated to collaboratively work with a diverse and innovative team of engineers, analysts and business management
Passion about getting the CE products customers want, to them, when they want them
--
Facebook is seeking a Data Scientist to join our Data Science team. Individuals in this role are expected to be comfortable working as a software engineer and a quantitative researcher. The ideal candidate will have a keen interest in the study of an online social network, and a passion for identifying and answering questions that help us build the best products.
Responsibilities
Work closely with a product engineering team to identify and answer important product questions
Answer product questions by using appropriate statistical techniques on available data
Communicate findings to product managers and engineers
Drive the collection of new data and the refinement of existing data sources
Analyze and interpret the results of product experiments
Develop best practices for instrumentation and experimentation and communicate those to product engineering teams
Requirements
M.S. or Ph.D. in a relevant technical field, or 4+ years experience in a relevant role
Extensive experience solving analytical problems using quantitative approaches
Comfort manipulating and analyzing complex, high-volume, high-dimensionality data from varying sources
A strong passion for empirical research and for answering hard questions with data
A flexible analytic approach that allows for results at varying levels of precision
Ability to communicate complex quantitative analysis in a clear, precise, and actionable manner
Fluency with at least one scripting language such as Python or PHP
Familiarity with relational databases and SQL
Expert knowledge of an analysis tool such as R, Matlab, or SAS
Experience working with large data sets, experience working with distributed computing tools a plus (Map/Reduce, Hadoop, Hive, etc.)
--
Data Scientist
Full-Time — San Francisco, CA
About this Job
As a Twitter Data Scientist, you will use statistical analysis and data mining techniques to better understand how users engage with Twitter, participate in creation and measurement of new and experimental features, and define meaningful success metrics for the whole organization. You should be passionate about finding insights in data and using quantitative analysis to answer complex questions. You should have a strong background in statistics and modeling, machine learning, and working with large datasets.
Responsibilities
Build complex statistical models that learn from and scale to petabytes of data.
Use Map-Reduce frameworks such as Pig and Scalding, statistical software such as R, and scripting languages like Python and Ruby.
Write and interpret complex SQL queries for standard as well as ad hoc data mining purposes.
Communicate findings to product, engineering, and management teams.
Requirements
MS or PhD in Statistics, Math, Engineering, Operations Research, Computer Science, or another quantitative discipline.
Experience with statistical programming environments like R or Matlab.
Experience with scripting languages, regular expressions, etc.
Interest in using discrete math, probability, and statistics to answer complex questions.
Experience in mapping business needs to engineering systems.
Experience with large datasets and map-reduce architectures like Hadoop is a plus.
Desired
Three or more years of industry experience is a plus.
Active user of Twitter.
--
Job Description
The LinkedIn Product team sets strategy, owns their numbers, and builds innovative products that professionals don't just use, but love. They thrive on data and on the opportunity to push the boundaries of web products- all in the pursuit of continued success as the world's largest professional network.
We're looking for a superb analytical mind to expand our team. You should be extremely intelligent, have a quantitative background, and be able to learn quickly and work independently. This is the perfect job for someone who is really smart, driven, and extremely skilled at creatively solving problems. If you have worked with manipulating big data to solve hard problems in creative ways, you will fit right in.
Requirements:
Top notch analytical and algorithm design abilities
Experience in information retrieval and data analytics (e.g. data classification, text mining, search algorithms, etc.)
Must be a hands-on implementer
Strong programming skills: proficiency in scripting languages (e.g. Python) and coding algorithms in Java or C
Proficiency with relational databases (SQL) and non-relational databases (Hadoop/pig)
Experience solving problems with big data
Excellent oral and written communication skills
Must be able to interact with diverse groups of technical and non-technical people
Must be able to communicate effectively to senior executives internally and externally
Above all else must be data driven
--
Job Responsibilities:
(Description)
Autotegrity, the newest addition to the ADP Dealer Services family, is seeking an exceptional Data Scientist professional to be a key addition to our fast-growing team in Cambridge (Kendall Square), MA. Do you enjoy finding opportunities to exploit by digging through large data sets? Are you a hacker who can throw together a system that leverages and automates your findings? Does the idea of being one step ahead of the competition excite you? This person will use their quantitative skills to analyze and manage online traffic acquisition and optimization strategies.
The role is ideal for someone who enjoys finding trends in large datasets and is comfortable working in a highly dynamic environment. You will be able to take action on your findings to see immediate results, adding revenue to the business right away. We use a variety of in-house tools, in addition to off the shelf software such as R, Hadoop, Excel and Pig to help us with our analysis.You will be an integral part of a fast-growing, entrepreneurial team, and you'll have the influence and tools available to help rapidly grow our business. We are looking for someone with analytical ability, an interest in modeling consumer behavior, and the enthusiasm to push us forward.
Much of the repetitive campaign management and reporting work involved in online marketing campaigns is automated with our innovative system. We use truly cutting edge custom-built web analytics software to manage the business. We are not looking for someone to generate reports and PowerPoint presentations. We need smart, quantitative people to rip through data and find trends and nuggets of insight that will help us grow.
Responsibilities
Help drive the strategic direction of the company by identifying opportunities in large, rich data sets.Implement automated systems to exploit opportunities that you discover.Design, implement and analyze tests to measure hypotheses for optimizing problems, ranging from setting bids in the online marketing auction to predicting consumer behavior on our websites.Aid in developing internal tools for data analysis.
Qualifications Required
(Experience, Skills, Academic):
Experience / Requirements
- BS/BA in Computer Science, Math, or another technical major, or demonstrated analytical and technical ability.
- 2+ years work experience preferred.
- Demonstrated experience in AI techniques, semantics, statistical analysis preferred.
- Working knowledge of data mining techniques, including regression analysis, clustering, decision trees, neural networks, SVM.
- Strong data analysis skills (Excel, Matlab/R).
- Strong hacking skills (Python/Ruby/Java).
- Strong foundation in statistics.
- Experience developing predictive models.
- Strong SQL skills.
- Distributed data processing technologies a plus.
- Entrepreneurial mindset.
- Great teamwork skills essential.
- Excellent communication and the ability to communicate technical results to non-technical team members crucial.
Autotegrity is a data analytics and online marketing company that runs an automotive shopping service connecting consumers with auto dealers. We help consumers find the vehicle that is right for them, while also helping dealerships find highly qualified, locally targeted customers. Our rigorous approach to data, along with our proprietary analytics system and search marketing platform are helping us redefine the lead buying process in the automotive space. Cobalt is the largest digital advertising platform provider for automotive industry and is transforming $8 billion dollars in advertising spending from traditional offline media to online advertising and, recently became a division of ADP in Dealer Services. Join ADP Dealer Services and assist in further developing this great digital marketing site that allows our customers to find, service, and sell more products to consumers. This position is a rewarding opportunity to express your creativity, work with intelligent and innovative people, and have your work touch millions of consumers. Cobalt hires all over the US for positions in the field. Automotive retailing solutions continue to be a growth industry as more and more dealerships and manufacturers recognize the operational efficiencies and the extraordinary marketing potential of Cobalt's technology. Benefits include an orca pass for public transportation in Seattle, medical, dental, and vision for employee and dependents, life insurance, paid vacation time, personal days, sick leave and paid holidays, 401K with match, employee referral bonuses, tuition reimbursement, discount programs, real estate services, and more.
Automatic Data Processing, Inc. (NASDAQ: ADP), with about $10 billion in revenues and about 570,000 clients, is one of the world's largest providers of business outsourcing solutions. Leveraging over 60 years of experience, ADP offers a wide range of human resource, payroll, tax and benefits administration solutions from a single source. ADP's easy-to-use solutions for employers provide superior value to companies of all types and sizes. ADP is also a leading provider of integrated computing solutions to auto, truck, motorcycle, marine, recreational vehicle, heavy manufacturing, and agricultural vehicle dealers throughout the world.
--
Primary Job Responsibilities
PayPal is a subsidiary of eBay, the largest online marketplace where hundreds of millions of products are listed daily. With more than 106 million active users, PayPal products touch on web, mobile and social media. Our mobile payment reached more than $4 billion in 2011.
The data science team at PayPal is building next-generation data mining and machine learning systems. Our research and development include recommendation system, text mining, natural language system, user classification, and personalized advertising. We work with structured (transaction), semi-structured (user behavior) and unstructured (text) data. Our data are large-scale and are hosted on Hadoop platform.
We're looking for researchers, research engineers or applied scientists with experience in machine learning, text mining or NLP.
Job Requirements
Our ideal candidate should have strong training in machine learning or data mining, and has experience in text analysis. You will lead the algorithm design and prototyping for analyzing large-scale PayPal data, including text data.
The successful candidate will be a team player, interested in developing scalable solution and quality software.
Education
Doctorate or Equivalent
--
With a mission to power the world's most engaging digital experiences, Bunchball is looking for highly skilled and motivated Data Scientists to help us extract actionable insights from our massive dataset of user activity. You’ll use statistical analysis and data mining techniques to help us better understand how the Nitro platform drives user engagement, determine whether new and experimental features should be launched, and measure Bunchball's success across the entire customer base. You should be passionate about finding insights in data and using quantitative analysis to answer complex questions.
Responsibilities
Leverage data to build insights about our users and customers, helping to understand and shape the Nitro Platform
Work with Product, Client Services and Engineering teams to generate insights, identify trends and opportunities, solve problems, and inform business and product decisions
Analyze and synthesize massive amounts of data and extract useful business insights
Build analytical dashboards and reports to support ongoing business decisions
Skills
MS/PhD/BS in Statistics, Math, Operations Research, Computer Science or another quantitative discipline
Experience with large datasets and distributed computing (Hadoop/Hive)
Strong SQL skills: extensive experience querying large, complex data sets
Proficiency with regular expressions and at least one scripting language (like Python, PHP)
Familiarity with statistical programming environments like R
Interest in using discrete math, probability and statistics to answer complex questions
Excellent communication skills, both oral and written, and a desire to teach and learn
Ability to thrive in a fast-paced, rapidly moving startup environment
Self-motivated, pro-active, curious, responsible and flexible
It's an exciting time at Bunchball with lots to do! We offer a competitive salary, aggressive commissions plan, 401(k), medical, dental and vision coverage. If you think you're the right person for this job, please send a resume to jobs@bunchball.com.
--
A9.com, headquartered in Palo Alto, CA, is a wholly owned subsidiary of Amazon.com. We are dedicated to the continued development of powerful, intelligent and intuitive search and advertising technologies that make it easy to locate products and services.
As a small office, we have a culture of creative problem-solving, intellectual design, fast-paced development and passionate product delivery. Plus, as a subsidiary, we have the longevity and resources of a larger company and the ability to introduce quickly our technology to millions of users.
We are seeking experienced, talented, energetic and self-driven individuals to join our Advertising Technology Platform team. The platform powers fast-growing, state-of-the-art online advertising programs for Amazon.com, other prime online properties and hand-held devices, and it is built with large-scale, distributed web services utilizing the latest technologies.
As a Principal Engineer/Data Scientist, you will:
Lead Ad Technology efforts in inventory forecasting, ad selection, revenue optimization, A/B testing or similar areas
Develop longer-term roadmaps for multiple technology initiatives, encompassing product definition, algorithmic and infrastructure architecture and team composition
Evaluate build-versus-buy
Do hands-on data analysis and modeling with huge data sets to explore alternative designs
Lead matrixed engineering teams on high-value projects, working side-by-side with product managers and software engineers on detailed requirements, technical designs and implementation
Attract PhD- and Master’s-equivalent talent
What you need:
Domain expertise in one or more of the following areas: Information Retrieval, Data Mining, Machine Learning, Algorithm design and analysis and Optimization
Extensive knowledge and expertise in online advertising
Ability to communicate complex technical concepts and solutions to all levels of the organization
Experience technically leading teams to deliver robust and scalable systems
Desire, ability and experience mentoring engineers and managers on advanced technical issues
It is even better if you have:
Experience with inventory forecasting for online advertising
You are going to love this job because you will:
Have a significant and measurable impact on the bottom line
Contribute to strategic products that reach large audiences
Work alongside other top-notch engineers, product managers and business owners on web-scale products and technologies
Have access to the vast technical tools and resources of Amazon.com
--
The Data Scientist role is a thought-leader and is responsible for initiating and overseeing projects that manage, mine, model, deliver, or create data. The tools used by the Data Scientist will include Machine Learning and Statistical Programming and this role is expected to develop new tools. The Data Scientist will be responsible for developing predictive systems, creating efficient algorithms, and improving data quality.
Duties and Responsibilities:
*Develop products through the application of analytical methods on large datasets.
*Provide thought-leadership and dependable execution on diverse projects.
*Innovate ways to improve our data quality and how we manage, mine, model, deliver and create data.
*Work closely within a team structure and develop the abilities of others.
*Identify emergent trends and opportunities for future client growth and development.
*May be requested to interpret findings with clients directly.
Job Role Specific Skills and Competencies:
*Innovative and strong analytical and algorithmic problem-solver.
*Ability to work with large datasets with minimal engineering support.
*Experience with large-scale data analysis and a demonstrated ability to identify key insights from data to solve business problems.
*Ability to build scalable systems that analyze huge data sets and make actionable recommendations.
*Ability to work with incomplete or imperfect data.
*Ability to build and interpret probabilistic models of complex, high-dimensional systems.
*Strong communication skills with the ability to communicate with senior leaders internal and external to the company.
*Understanding of business processes and the utilization of technology to drive those processes.
*Must have the interpersonal skills to interact with technical and non-technical people.
*Collaborative filtering and automatic tree generation methods.
*Clustering techniques (principal components analysis, k-means, etc.).
*Strong programming skills and proficiency in scripting languages (e.g. Python) and coding algorithms in Java, PHP, or C.
*Proficiency with relational databases and new generation analytical tools.
*Analytical and algorithm design abilities with ordinal and nominal data.
Qualifications
Required Skills:
*PHD desired, MS with extensive experience as a Data Analyst on large datasets or research experience in this area preferred. May also consider strong candidates with a BS and relevant work experience.
*Candidates must have strong mathematical and statistical background.
*Semantic text mining or natural language processing experience required.
*Background in data mining, machine learning, statistical analysis and modeling.
*Hadoop or other Map Reduce implementation experience desirable.
*Experience in high-stakes information retrieval and statistical analysis (i.e. Bioinformatics, fraud detection).
*Experience in information retrieval and data analytics (e.g. data classification, text mining, search algorithms, etc).
*Experience working within an agile development process desirable.
*Familiarity with linear algebra and manipulating data using matrix algorithms.
We are looking for candidates with more than just computational skills; candidates must have demonstrated experience where they were able to think “outside of the box” while being responsive to emergent technology, industry, and company trends. Additionally, and ideal candidate would possess the following approaches to their work: hacker (ability to work independently), data analyst (has a demonstrated passion for data), and entrepreneur (has a passion for creating).
About Acxiom
Acxiom is a recognized leader in marketing services and technology that enable marketers to successfully manage audiences, personalize consumer experiences and create profitable customer relationships. Our superior industry-focused, consultative approach combines consumer data and analytics, databases, data integration and consulting solutions for personalized, multichannel marketing strategies. Acxiom leverages over 40 years of experience in data management to deliver high-performance, highly secure, reliable information management services. Founded in 1969, Acxiom is headquartered in Little Rock, Arkansas, USA, and serves clients around the world from locations in the United States, Europe, Asia-Pacific and South America. For more information about Acxiom, visit Acxiom.com.
--
Do you enjoy finding patterns in big data?
Do you want the opportunity to apply the latest AI algorithms to real data?
Do beautiful data visualizations get you excited?
Do you want to have millions of people use your code every day?
Do you consider yourself an innovator? Tinkerer? Dreamer?
Come be a part of Trulia’s new Data Science Lab! You’ll have the opportunity to work with massive datasets--think trillions of user actions and millions of homes. You’ll get to work on (and help build) a team that loves AI technologies, and can’t get enough of data visualizations. You’ll get to propose projects, and see them through the whole life cycle. Interested yet?
Experience / Skills:
o Java, Hadoop experience required, experience with R, JavaScript is a bonus
o Experience with (or a strong desire to use) the latest AI, machine learning, data mining, and network science algorithms
o Appreciation of good design and beautiful products
o Desire to stay abreast of the latest research in data mining
Responsibilities:
o Explore massive amounts of user behavior data and real estate content data
o Brainstorm ideas for tools that communicate the most valuable insights of that data to Trulia users and real estate agents
o Engineer prototypes
o Market prototypes within the company
o Scale up prototypes to production
Education:
o BA, BS, MS, or PhD in Computer Science, Data Mining, Statistics, or Applied Mathematics
Trulia is the innovative leader in the online real estate industry. We are a successful company and expanding rapidly. This is a great opportunity to join a team of smart and passionate people with a vision to help millions of users make smarter decisions about real estate whether they are renting, buying or already own a home.
@microprediction
Copy link

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment