Bringing Ancient Art to Life: Breathing Life into Shanshui Art with AI and Perlin Noise

When we think of traditional Chinese Shanshui (mountain-water) paintings, we imagine serene, timeless scenes of misty mountains and flowing rivers—a harmony between nature and artistic expression that has been refined over centuries. But what if these tranquil landscapes could come to life?
Bringing Ancient Art to Life: Breathing Life into Shanshui Art with AI and Perlin Noise
Like

Share this post

Choose a social network to share with, or copy the URL to share elsewhere

This is a representation of how your post may appear on social media. The actual post will vary between social networks

Explore the Research

SpringerLink
SpringerLink SpringerLink

Generative AI Shanshui animation enhancement using Perlin noise and diffusion models - Discover Artificial Intelligence

Deep learning models have achieved remarkable advancements in image generation but face persistent challenges in synthesizing traditional Shanshui (mountain-water) landscape paintings due to limited domain-specific training data and the complexity of aesthetic principles. This study integrated Perlin Noise, Stable Diffusion, ControlNet, and AnimateDiff to enhance Shanshui landscape generation and animation. Perlin Noise constructs naturalistic skeletal structures, which are further refined using ControlNet for precise structural control. Advanced prompt engineering with GPT-4 and Textual Inversion improved prompt descriptiveness and mitigated low-quality outputs. Furthermore, LoRA fine-tuning improved the adaptability of our Shanshui landscapes model. Integrating I2V Encoders and AnimateDiff enabled the seamless transformation of static landscape images into dynamic animations, preserving artistic authenticity while introducing motion consistency. The experimental results demonstrated significant improvements in realism, stylistic fidelity, and diversity, addressing key limitations in existing generative approaches. This framework not only advances the field of generative AI in digital art but also offers new opportunities for the creation of multimedia content and cultural preservation through the synthesis of computational Shanshui animation.

In our recent paper, Generative AI Shanshui Animation Enhancement using Perlin Noise and Diffusion Models,” we set out to explore exactly that: using modern generative AI to animate classical Shanshui art without losing its soul.

Performance of the animation generated by AnimateDiff with Motion Module
Performance of the animation generated by AnimateDiff with our Motion Module

The Challenge: Preserving Art in the Age of AI

Generative AI has made incredible strides in image and video synthesis, but traditional art forms like Shanshui painting remain a tough nut to crack. The main hurdles are:

  • Limited training data: There aren’t enough high-quality, digitized Shanshui paintings to train a model from scratch.

  • Aesthetic complexity: Shanshui isn’t just about shapes—it’s about composition, brushstroke style, mood, and cultural nuance.

Simply fine-tuning a diffusion model on a few Shanshui images wasn’t enough. We needed a way to guide the AI to understand the structure and spirit of the art, not just mimic it.

Our Approach: A Hybrid Creative Pipeline

We built a modular system that combines several AI techniques into a coherent creative workflow:

1. Generating the Skeleton with Perlin Noise

Instead of starting from noise or random latent vectors, we used Perlin Noise—a classic computer graphics algorithm—to generate the foundational “skeleton” of the landscape. Perlin Noise gives us natural-looking, continuous variations that mimic the organic flow of ink and brushwork. Mountains, ridges, and water paths emerge in a way that already feels artistic, not algorithmic.

2. Guiding Diffusion with ControlNet and GPT-4

We then used Stable Diffusion paired with ControlNet to “fill in” the skeleton with style, color, and detail. ControlNet ensured the generated structure stayed true to the original sketch, while GPT-4 helped generate rich, descriptive prompts that captured the essence of Shanshui—terms like “misty mountains,” “flowing river,” “distant pine trees,” and “soft ink wash.”

This prompt engineering step was crucial. It allowed us to steer the diffusion model toward artistic authenticity without needing thousands of training examples.

3. From Image to Animation with AnimateDiff

Here’s where the magic happens: turning a static painting into a living animation. We developed an Image-to-Video (I2V) Encoder that prepares the generated landscape for AnimateDiff, a diffusion-based video generation model. By introducing controlled noise and temporal dynamics, we created smooth, coherent motion—clouds drifting, water flowing, leaves rustling—all while preserving the painting’s style.

4. Refining with Textual Inversion and LoRA

To further enhance quality, we used Textual Inversion to teach the model what not to generate (e.g., “blurry,” “oversaturated”), and experimented with LoRA fine-tuning to adapt the model more closely to Shanshui aesthetics. Interestingly, we found that a well-designed Perlin Noise backbone often outperformed LoRA in maintaining structural integrity and stylistic purity.

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Go to the profile of Anna Keen
4 months ago

Hello I am an artist (landscape oil painter). I like the way you have animated these paintings.  I would like to email you for further discussion.

Go to the profile of Kanoksak Wattanachote
4 months ago

Welcome

Follow the Topic

Art History
Humanities and Social Sciences > Arts > Art History
Computer Imaging, Vision, Pattern Recognition and Graphics
Mathematics and Computing > Computer Science > Computer Imaging, Vision, Pattern Recognition and Graphics
Artificial Intelligence
Mathematics and Computing > Computer Science > Artificial Intelligence
Artificial Photosynthesis
Physical Sciences > Chemistry > Organic Chemistry > Green Chemistry > Artificial Photosynthesis
Animation
Humanities and Social Sciences > Media and Communication > Film and Television Studies > Animation
Photosynthesis
Life Sciences > Biological Sciences > Plant Science > Photosynthesis

Related Collections

With Collections, you can get published faster and increase your visibility.

Transforming Education through Artificial Intelligence: Opportunities, Challenges, and Future Directions

Artificial Intelligence (AI) is rapidly changing the educational field by enabling personalized learning, intelligent tutoring systems, automated assessments, learning analytics, and administrative automation.

This collection invites original research, systematic reviews, and visionary perspectives on the transformative impact of AI in education. It aims to explore how AI technologies can enhance equity, inclusion, and efficiency in educational settings across different contexts, including higher education, K-12, vocational training, and lifelong learning. This collection will address technical, pedagogical, ethical, and policy aspects, fostering interdisciplinary perspectives and evidence-based insights.

This Collection supports and amplifies research related to SDG 4 and SDG 9.

Keywords: Artificial Intelligence, AI in Education, Educational Technology, Data Analytics, AI Ethics

Publishing Model: Open Access

Deadline: Nov 30, 2026

Artificial Intelligence in Medical Imaging

This Topical Collection focuses on artificial intelligence (AI) in medical imaging, which aims to highlight recent advancements in the field of medical imaging analysis using AI and big data. Medical imaging is an essential tool for diagnosis, treatment, and monitoring of various medical conditions. However, analyzing medical images can be time-consuming, costly, and prone to human error. With the emergence of AI, many of these challenges can be addressed by automating tasks involved in medical imaging analysis.

We welcome submissions on various topics related to AI in medical imaging, including, but not limited to, novel AI algorithms and techniques for medical image analysis, the integration of AI into clinical workflows, the development of software packages for medical imaging analysis, and the evaluation of AI methods for clinical use. Additionally, we encourage submissions that explore the ethical and social implications of AI in medical imaging, such as the impact on patient privacy, data security, and clinical decision-making.

Overall, this Topical Collection aims to provide a comprehensive overview of the recent advancements in AI in medical imaging and to promote interdisciplinary research and collaborations between AI researchers, medical imaging experts, and clinicians.

Keywords: Clinical Decision Support System; Computer-Aided Diagnosis; Computer Vision; Deep Learning; Diagnostic Imaging; Image Classification; Image Processing; Image Segmentation; Object Detection; Precision Medicine; Radiomics

Publishing Model: Open Access

Deadline: Aug 10, 2026