TechTalks Newsletter

Share this post
A dataset to automate programming tasks
bdtechtalks.substack.com

A dataset to automate programming tasks

Ben Dickson
May 17, 2021
Comment
Share

Last week, IBM Research released Project CodeNet, a programming dataset with 14 million samples. CodeNet is meant to train machine learning models that automate programming tasks.

While machine learning is nowhere near replacing programmers, it can become the basis for many tools that can make programmers more productive. The dataset is very well annotated and can be used to develop different kinds of ML models. Some potential uses for CodeNet include the following:

  • Translation between different programming languages

  • Advanced recommendation and autocomplete

  • Code optimization

  • Code generation

Read the full story on TechTalks.

CommentComment
ShareShare

Create your profile

0 subscriptions will be displayed on your profile (edit)

Skip for now

Only paid subscribers can comment on this post

Already a paid subscriber? Sign in

Check your email

For your security, we need to re-authenticate you.

Click the link we sent to , or click here to sign in.

TopNewCommunity

No posts

Ready for more?

© 2022 Ben Dickson
Privacy ∙ Terms ∙ Collection notice
Publish on Substack Get the app
Substack is the home for great writing