<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE ArticleSet PUBLIC "-//NLM//DTD PubMed 2.7//EN" "https://dtd.nlm.nih.gov/ncbi/pubmed/in/PubMed.dtd">
<ArticleSet>
<Article>
<Journal>
				<PublisherName>Shahrood University of Technology</PublisherName>
				<JournalTitle>Journal of AI and Data Mining</JournalTitle>
				<Issn>2322-5211</Issn>
				<Volume></Volume>
				<Issue>Articles in Press</Issue>
				<PubDate PubStatus="epublish">
					<Year>2026</Year>
					<Month>02</Month>
					<Day>10</Day>
				</PubDate>
			</Journal>
<ArticleTitle>Skeleton-Based Sign Language Generation Using a Transformer-based Generative Model</ArticleTitle>
<VernacularTitle></VernacularTitle>
			<FirstPage></FirstPage>
			<LastPage></LastPage>
			<ELocationID EIdType="pii">3723</ELocationID>
			
<ELocationID EIdType="doi">10.22044/jadm.2025.16369.2759</ELocationID>
			
			<Language>EN</Language>
<AuthorList>
<Author>
					<FirstName>Rozhin</FirstName>
					<LastName>Mohammadizand</LastName>
<Affiliation>Electrical and Computer Engineering Department, Semnan University, Semnan, Iran</Affiliation>

</Author>
<Author>
					<FirstName>Razieh</FirstName>
					<LastName>Rastgoo</LastName>
<Affiliation>Electrical and Computer Engineering Department, Semnan University, Semnan, Iran</Affiliation>

</Author>
</AuthorList>
				<PublicationType>Journal Article</PublicationType>
			<History>
				<PubDate PubStatus="received">
					<Year>2025</Year>
					<Month>06</Month>
					<Day>07</Day>
				</PubDate>
			</History>
		<Abstract>Sign language is a structured, non-vocal form of communication primarily used by individuals who are deaf or hard of hearing, who often face challenges interacting with non-signers. To address this, translation systems between sign and spoken language are essential, encompassing sign language recognition and production. In this work, we focus on sign language production and propose a deep learning framework for generating skeleton-based video representations of sign language at the word level. Our approach employs a conditional Generative Adversarial Network (cGAN) with transformer embeddings in both generator and discriminator, augmented with bone-length and joint-angle constraints and a classifier-guided loss to ensure anatomically plausible and semantically consistent gestures. We further introduce a novel loss function to improve human keypoint generation for sign representation. Extensive experiments on three benchmark datasets demonstrate that our method outperforms state-of-the-art approaches according to statistical (MMD) and perceptual (FID) metrics, while qualitative analyses confirm that the generated gestures are temporally smooth, anatomically accurate, and semantically meaningful. These results highlight the effectiveness of our model in advancing word-level sign language synthesis.</Abstract>
		<ObjectList>
			<Object Type="keyword">
			<Param Name="value">Sign Language Generation</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Skeleton</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Transformer</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Generative Adversarial Network (GAN)</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Sign Language Recognition</Param>
			</Object>
		</ObjectList>
<ArchiveCopySource DocType="pdf">https://jad.shahroodut.ac.ir/article_3723_5254dffd32aab5b973535bcbd43eee44.pdf</ArchiveCopySource>
</Article>
</ArticleSet>
