• Contact Us
  • Privacy Policy
Saturday, March 6, 2021
NewzTech.net
  • Home
  • Technology
  • Gadgets
  • Security
  • AI
  • Military
  • Deals
No Result
View All Result
  • Home
  • Technology
  • Gadgets
  • Security
  • AI
  • Military
  • Deals
No Result
View All Result
NewzTech.net
No Result
View All Result

DeepMind Researchers Propose Normalizer-Free ResNets (NFNets) To Achieve Large-Scale Image-Recognition Without Batch Normalization

February 22, 2021
in AI
4 min read
DeepMind Researchers Propose Normalizer-Free ResNets (NFNets) To Achieve Large-Scale Image-Recognition Without Batch Normalization
0
SHARES
5
VIEWS
ShareShareShareShareShare

Source: https://arxiv.org/pdf/2102.06171.pdf

A team of researchers at DeepMind introduces Normalizer-Free ResNets (NFNets) and demonstrates that the image recognition model can be trained without batch normalization layers. The researchers present a new clipping algorithm to design models that match and even outperform the best batch-normalized classification models on large-scale datasets while also significantly reducing training time.

Batch normalization and its shortcomings

The batch normalization is a vital component of most image classification models. It can accelerate training, enables higher learning rates, improves generalization accuracy, and has a regularisation effect. However, batch normalization suffers from three practical disadvantages:

  • It is costly in memory and time.
  • It introduces discrepancies between model behaviors during training and inference time, thereby requiring additional fine-tuning.
  • It destroys the independence between training examples in the minibatch.

Many recent studies have successfully trained deep ResNets without normalization. However, the resulting models do not match SOTA batch-normalized networks’ test accuracy and are frequently unstable for strong data augmentations or large learning rates. 

Novel Normalizer-Free networks

A team of researchers at DeepMind have designed a family of Normalizer-Free ResNets (NFNets) to address this issue of weakness. NFNets can be trained in larger batch sizes and stronger data augmentations. It has set new SOTA validation accuracies on ImageNet. 

For training NFNets with larger batch sizes and stronger data augmentations, the team has employed Adaptive Gradient Clipping (AGC). It clips gradients based on the unit-wise ratio of gradient norms to parameter norms. AGC allows training Normalizer-Free Networks with larger batch sizes and stronger data augmentations. 

The researchers have used a range of ablations comparing batch-normalized ResNets to NF-ResNets with and without AGC to test the accuracy of AGC. The results explain that AGC efficiently scales NF-ResNets to larger batch sizes.

https://arxiv.org/pdf/2102.06171.pdf

Building on AGC, the team trained a family of Normalizer-Free architectures (NFNets). They then implemented them to a SE-ResNeXt-D model (strong baseline for Normalizer-Free Networks) with revised width, depth patterns, and a second spatial convolution. Lastly, they applied AGC to every parameter except for the linear weight of the classifier layer.

The researchers compared the accuracy of the NFNet model with a set of standard models such as SENet (Hu et al., 2018), LambdaNet, (Bello, 2021), BoTNet (Srinivas et al., 2021), and DeIT (Touvron et al., 2020) — on ImageNet dataset. They tested NFNets in the transfer learning regime by pre-training on a dataset of 300 million labeled images to validate the NormalizerFree networks’ suitability to transfer learning after large-scale pre-training.

The NFNet-F5 model achieved a top-1 validation accuracy of 86.0 percent, outperforming the former state-of-the-art model EfficientNet-B8. Additionally, they show that Normalizer-Free models have outperformed their batch-normalized counterparts when fine-tuning on ImageNet after large-scale pre-training, obtaining an accuracy of 89.2 percent. 

Paper: https://arxiv.org/pdf/2102.06171.pdf

Github: https://github.com/deepmind/deepmind-research/tree/master/nfnets

Suggested

Credit: Source link

ShareTweetSendSharePin
Previous Post

After Saturday’s engine failure, Boeing says many 777s should be grounded

Next Post

VR Helps Rehabilitate Individuals With Motor Dysfunctions

Related Posts

Google and Facebook Introduce ‘LazyTensor’ That Enables Expressive Domain-Specific Compilers
AI

Google and Facebook Introduce ‘LazyTensor’ That Enables Expressive Domain-Specific Compilers

March 6, 2021
This Singapore Based Startup Uses Artificial Intelligence Powered Clinical Assistant to Improve Doctor’s Productivity and Patient Care
AI

This Singapore Based Startup Uses Artificial Intelligence Powered Clinical Assistant to Improve Doctor’s Productivity and Patient Care

March 5, 2021
Athletes Reach Peak Performance With Intel’s 3D and AI Tech
AI

Athletes Reach Peak Performance With Intel’s 3D and AI Tech

March 4, 2021
2021 and Beyond: Creating Better Work Spaces with Computer Vision
AI

2021 and Beyond: Creating Better Work Spaces with Computer Vision

March 4, 2021
Load More
Next Post
VR Helps Rehabilitate Individuals With Motor Dysfunctions

VR Helps Rehabilitate Individuals With Motor Dysfunctions

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

News Updates

Microsoft email server flaws exploited to hack at least 30,000 US organizations

Microsoft email server flaws exploited to hack at least 30,000 US organizations

March 6, 2021
Inside the stunning Black mythos of Drexciya and its Afrofuturist ’90s techno

Inside the stunning Black mythos of Drexciya and its Afrofuturist ’90s techno

February 28, 2021
A $50,000 Bug Could’ve Allowed Hackers Access Any Microsoft Account

A $50,000 Bug Could’ve Allowed Hackers Access Any Microsoft Account

March 3, 2021
Samsung Galaxy Chromebook 2: A better balance of premium and practical

Samsung Galaxy Chromebook 2: A better balance of premium and practical

March 1, 2021
Research Team Aims to Create Explainable AI for Nuclear Nonproliferation and Nuclear Security

Research Team Aims to Create Explainable AI for Nuclear Nonproliferation and Nuclear Security

March 2, 2021
ADVERTISEMENT
NewzTech.net

This is a news portal that aims to share latest news about technology, gadgets, security, AI, military and much more stuff like that. Feel free to get in touch with us!

What’s New Here!

VW ‘Project Trinity’ teases cars that unlock features on-demand

VW ‘Project Trinity’ teases cars that unlock features on-demand

March 6, 2021
Google and Facebook Introduce ‘LazyTensor’ That Enables Expressive Domain-Specific Compilers

Google and Facebook Introduce ‘LazyTensor’ That Enables Expressive Domain-Specific Compilers

March 6, 2021

Subscribe Now

Loading

© 2020 newztech.net - All rights reserved!

No Result
View All Result
  • Home
  • Technology
  • Gadgets
  • Security
  • AI
  • Military
  • Deals

© 2020 newztech.net - All rights reserved!