Home Blog Page 84

AI applications in clinical settings reveal major limitations

  • Researchers design a more realistic test to evaluate AI’s clinical communication skills.
  • Enhancing AI models like CRAFT-MD can forge a path toward integrating these technologies into clinical practice in an effective and ethical manner.

The advent of artificial intelligence (AI) tools, particularly large language models such as ChatGPT, has generated considerable optimism regarding their potential to alleviate clinician workload in healthcare.

These models are envisioned to streamline processes by triaging patients, collecting medical histories, and even providing preliminary diagnoses.

However, despite their impressive performance on standardised medical assessments, recent research conducted by Harvard Medical School and Stanford University reveals significant shortcomings when these models are subjected to scenarios that more accurately reflect real-world clinical interactions.

The study by researchers at Harvard Medical School and Stanford University introduced an evaluation framework known as the Conversational Reasoning Assessment Framework for Testing in Medicine (CRAFT-MD).

The framework was designed to assess the performance of four large language models in simulated clinical conversations, thereby providing a more realistic measure of their capabilities.

While these models excelled in answering questions akin to those found on medical board exams, their performance deteriorated markedly when engaged in dynamic, conversational exchanges that mirror actual patient-clinician interactions.

CRAFT-MD framework

The findings highlight a critical gap in the evaluation of AI models for clinical use. The traditional method of assessing AI performance through multiple-choice questions fails to capture the complexities inherent in real-world medical dialogues.

As noted by study co-first author Shreya Johri, this approach presupposes that all relevant information is presented in a clear and concise manner, a scenario rarely encountered in actual clinical practice.

“The CRAFT-MD framework aims to rectify this by evaluating the models’ abilities to gather information about symptoms, medications, and family history in a conversational context, thereby enhancing the realism of the assessment.”

The study’s senior author, Pranav Rajpurkar, emphasises the paradox that while AI models may perform admirably on standardised tests, they struggle with the nuanced, back-and-forth interactions typical of a doctor’s visit.

“The dynamic nature of medical conversations—necessitating timely inquiries, the synthesis of disparate information, and reasoning through symptoms—presents challenges that extend beyond mere factual recall or multiple-choice responses.”

The researchers used CRAFT-MD to test four AI models — both proprietary and commercial and open-source ones — for performance in 2,000 clinical vignettes featuring conditions common in primary care and across 12 medical specialties. 

The applications of AI in clinical settings reveal significant limitations, particularly in conducting nuanced medical conversations. These limitations manifest in the models’ inability to coherently take medical histories and formulate accurate diagnoses.

Targeted recommendations

A closer examination indicates that AI systems frequently falter in eliciting relevant patient information through appropriate questioning, inadvertently neglecting vital clues that could inform clinical decisions. Furthermore, these models demonstrate a decline in performance when tasked with analysing open-ended information as opposed to structured multiple-choice inputs.

The propensity for error is exacerbated in dynamic, back-and-forth dialogues, which better resemble the complexity of actual patient interactions.

To enhance the utility of AI in real-world medical scenarios, a set of targeted recommendations is proposed for both developers and regulators of AI technologies.

First and foremost, training should prioritise the use of conversational, open-ended questions that accurately reflect the organic exchanges observed in doctor-patient interactions. Furthermore, there is an imperative need to rigorously assess the ability of these models to ask pertinent questions and efficiently extract critical information from patients.

In addition, AI systems should be designed to juggle multiple conversational threads, synthesizing and integrating disparate information from various interactions.

Enhancing AI models

Importantly, the capacity to unify both textual notes and non-textual data—such as imagery or electrocardiograms (EKGs)—will significantly enhance diagnostic accuracy. Incorporating capabilities to interpret non-verbal cues, including facial expressions, tone, and body language, would further elevate the effectiveness of these AI agents.

Moreover, the standard evaluation process must shift to encompass both AI models and human experts. Relying solely on human evaluators is resource-intensive and costly.

For instance, the CRAFT-MD model demonstrated remarkable efficiency by processing 10,000 conversations within 48 to 72 hours, a task that would otherwise demand extensive human resources, spanning hundreds of hours for simulations and evaluations. Deploying AI for preliminary evaluations not only conserves human capital but also mitigates the risks associated with trials involving real patients and unverified AI tools.

Ultimately, the commitment to enhancing AI models like CRAFT-MD can forge a path toward integrating these technologies into clinical practice in an effective and ethical manner.

As noted by Roxana Daneshjou, a prominent figure in the realm of biomedical data science, adopting frameworks that closely mirror genuine healthcare interactions will catalyse the progress of AI model testing and performance in the healthcare landscape. In doing so, the potential for AI to augment clinical practice may be actualised, paving the way for improved patient outcomes across medical disciplines.

Related Posts:

Researchers train solar panels to dance with the wind

  • Innovative approach not only minimises stress on the panels but also preserves energy production during adverse weather.
  • By integrating advanced fluid dynamics with artificial intelligence, the researchers propose a novel approach to mitigate the risks associated with high winds.

Wind possesses a dual nature in its relationship with solar power grids, imparting both beneficial and detrimental effects.

On one hand, wind plays a crucial role in enhancing the performance of solar panels. It facilitates the removal of dirt and dust accumulation, which can obstruct sunlight and diminish energy output.

Furthermore, as solar panels tend to lose efficiency when subjected to elevated temperatures, the cooling effect of wind can significantly enhance their operational efficacy.

Rise in insurance claims

The interplay between wind and solar energy exemplifies the potential for natural elements to bolster renewable energy systems.

Conversely, the fragility of solar panels renders them susceptible to high-wind events, which can lead to structural failures and necessitate extensive repair efforts.

The increasing prevalence of severe weather has resulted in a notable rise in insurance claims related to photovoltaic panel damage, underscoring the vulnerabilities associated with their deployment.

As solar power emerges as the fastest-growing energy sector globally, the need to address these risks becomes paramount. The potential of solar photovoltaic plants to contribute to the Net Zero Emissions by 2050 initiative is significant; however, this potential is compromised by the challenges posed by extreme weather conditions.

Recent research published in Physics of Fluids by a team at the Centre for Material Forming at PLS University in Sophia Antipolis, France, introduces an innovative numerical decision-making framework aimed at enhancing solar panel resilience against wind-related damage.

Safeguarding solar installations

By integrating advanced fluid dynamics with artificial intelligence, the researchers propose a novel approach to mitigate the risks associated with high winds.

As Elie Hachem, one of the authors, articulates, this framework represents an opportunity to rethink traditional methods of safeguarding solar installations.

Historically, efforts to protect solar panels have centred on adjusting row spacing, ground clearance, and tilt angles. Current tracking mounts, which optimise solar exposure by rotating panels, often revert to a stowed position during high winds, sacrificing energy output and failing to provide adequate protection against severe gusts.

In contrast, the new framework treats solar panels as independent decision-makers, capable of adapting their angles based on real-time wind conditions. The innovative approach not only minimises stress on the panels but also preserves energy production during adverse weather.

US and China renew limited pact in scientific and technological fields

  • New accord includes provisions for dispute resolution and termination clauses, ensuring that either party may withdraw should the other violate the terms of the agreement.

The renewal of the Science and Technology Cooperation Agreement between the United States and China marks a significant development in the context of the increasingly complex geopolitical landscape.

Signed on Friday, the updated accord enables limited government cooperation in scientific and technological fields, simultaneously maintaining a channel of dialogue amidst prevailing tensions concerning national security and trade.

The agreement, which has evolved from its original inception in 1979, now embodies modern demands for transparency and accountability.

US negotiators approached the renewal with a keen eye on safeguarding national interests, incorporating specific mechanisms designed to enhance transparency and enforce compliance. According to senior officials from the State Department, these adjustments reflect an acute awareness of the strategic context in which this cooperation occurs, acknowledging the persistent strains characterising US-China relations.

The exclusion of critical and emerging technologies from the agreement further underscores a cautious approach, prioritising American security considerations while allowing for limited collaboration in less sensitive areas.

Mutual accountability

Notably, the new accord includes provisions for dispute resolution and termination clauses, ensuring that either party may withdraw should the other violate the terms of the agreement.

Such measures indicate a commitment to a framework of mutual accountability, a crucial element given the history of intellectual property concerns linked to Chinese practices.

The anticipatory measures included in the updated agreement serve to mitigate the risk of unilateral data sharing, striving for reciprocity in collaboration moving forward.

As underscored by the ongoing review processes by both the State Department and the White House, any proposals for collaboration with China will undergo rigorous scrutiny. The structured oversight aims to reinforce US interests while recognising the necessity of maintaining some degree of scientific partnership, given the potential adverse effects of a complete cessation of cooperation.

US to grant $225m in chip subsidies to Bosch

  • Facility projected to account for over 40% of all US SiC device manufacturing capacity once fully operational by 2026.

The announcement from the US Commerce Department regarding a preliminary agreement with German auto supplier Bosch signifies a pivotal moment in the American semiconductor landscape, particularly in the burgeoning field of electric vehicles (EVs).

The accord involves up to $225 million in subsidies that will facilitate Bosch’s ambitious plan to invest $1.9 billion in transforming its manufacturing operations in Roseville, California, for the production of silicon carbide (SiC) power semiconductors.

Silicon carbide semiconductors are integral to the automotive, telecommunications, and defense industries, primarily due to their energy efficiency and capability to enhance both the driving and charging performance of electric vehicles.

Related Posts:

By supporting Bosch’s endeavor, the Commerce Department is not only bolstering domestic semiconductor production but is also addressing the critical supply chain vulnerabilities that were laid bare during the COVID-19 pandemic.

The disruptions in semiconductor manufacturing in Asia had severely affected auto manufacturers, underscoring the necessity for a more resilient and localized production base.

The funding plan is part of a broader initiative established in 2022, which allocated a substantial $52.7 billion fund aimed at boosting US semiconductor production and research.

Funding opportunities

The timely execution of this initiative is paramount, especially with the impending transition of power in Washington and the urgent need to secure critical funding opportunities for American manufacturers.

Bosch’s acquisition of TSI Semiconductors’ assets in California and its projection to commence SiC chip production by 2026 illustrates a strategic pivot towards enhancing domestic capabilities in semiconductor manufacturing.

The potential that this facility holds—projected to account for over 40 per cent of all US SiC device manufacturing capacity once fully operational—could significantly bolster the nation’s position in advanced technology sectors.

The implications of this investment extend beyond mere production metrics. As noted by Paul Thomas, president of Bosch in North America, the investment in Roseville fosters local production of essential components for the transition to electrification, aligning with the broader objectives of clean energy and sustainability.

Representative Doris Matsui’s endorsement further emphasises the importance of this initiative for California and the nation, marking a significant step towards advancing clean mobility and electric vehicle technology.

Will emergence of Android XR be a new era for VR and AR?

  • Gap between theoretical advantages of these technologies and their practical implementation can lead to hesitance among consumers, who may prefer to refrain from investing in products that do not yet demonstrate clear benefits.
  • One factor contributing to consumer scepticism is the disparity between advanced technology and practical application.

Google, in collaboration with Samsung and Qualcomm, unveiled its latest innovation, the Android XR operating system, designed to enhance virtual and augmented reality (AR/VR) experiences and the first platform built entirely for the Gemini era.

Samsung’s Project Moohan, the first device powered by Android XR, exemplifies this new direction. This virtual reality headset, featuring mixed reality capabilities through external cameras, combines elements from both Apple’s Vision Pro and Meta’s Quest 3.

The user interface of Android XR resembles existing AR/VR systems, allowing users to arrange app windows in a virtual environment while maintaining awareness of their physical surroundings.

Notably, the integration of Google’s Circle to Search technology enables users to interact with real-world objects and access information seamlessly.

The potential of Android XR extends beyond headsets. Google’s ambitions for augmented reality glasses are particularly noteworthy, especially given the mixed success of its previous endeavor, Google Glass.

The new glasses promise to offer users real-time information and navigation through voice commands and an in-lens display, leveraging the capabilities of Gemini AI.

A niche segment

The advancement could redefine how users interact with their environment, providing contextual information with unprecedented ease.

In recent years, the infusion of artificial intelligence into augmented reality (AR) and virtual reality (VR) technologies has generated considerable optimism within the tech community.

Google, a leader in innovation, is at the forefront of this trend, particularly with its ambitions in developing headsets and smart glasses that seamlessly integrate AI functionalities.

However, despite this enthusiasm, there remains a palpable scepticism among consumers that cannot be overlooked. The prevailing sentiment suggests that the current demand for AR and VR technologies has notably declined, as evidenced by recent shipment data.

While augmented reality (AR) and virtual reality (VR) headsets are gradually gaining traction, they still occupy a niche segment of the technology market.

According to International Data Corporation (IDC), the industry is projected to ship approximately 6.7 million AR/VR units in 2024, with an anticipated growth to 22.9 million units by 2028.

However, this figure pales in comparison to the staggering 316.1 million smartphones shipped globally in just the third quarter of 2024.

One factor contributing to consumer scepticism is the disparity between advanced technology and practical application. While Google’s AI-forward approach promises a more integrated and user-centric experience, potential customers often question the immediate utility and relevance of such innovations in their daily lives.

The gap between the theoretical advantages of these technologies and their practical implementation can lead to hesitance among consumers, who may prefer to refrain from investing in products that do not yet demonstrate clear benefits.

Moreover, the broader economic landscape has compounded these issues, as economic uncertainty has made consumers more discerning about their expenditures.

Lingering complexities

The decline in demand for AR and VR is indicative of a market still maturing, where early adopters have yet to be convinced of the value proposition that lies in advanced headsets and smart glasses.

These devices must overcome not only technological barriers but also psychological ones, as consumers grapple with the potential obsolescence of current offerings and the lingering complexities that accompany emerging technologies.

The potential for smart glasses to capture the average consumer’s interest remains uncertain. Smart glasses, in particular, represent a relatively untested technology.

Companies like Meta have ventured into this space with products such as the Ray-Ban Meta glasses, which feature built-in cameras and an AI assistant. Despite claims of strong sales, Meta has not disclosed specific figures, leaving the market’s reception ambiguous.

Ultimately, the future of smart glasses hinges on consumer demand, which will become clearer as Android XR headsets and glasses enter the market.

The industry’s ability to gauge genuine interest in high-tech eyewear will depend on consumer engagement and the extent to which these devices can seamlessly integrate into everyday life.

As of now, the appetite for smart glasses among average consumers remains an open question, one that the market must explore further in the coming years.

Oasis Security finds vulnerability within Microsoft’s MFA implementation

  • Researchers say that achieving a 50% probability of success would only require approximately 24 login sessions, taking about 70 minutes to execute.
  • Crux of the vulnerability lies in the lack of rate limiting associated with MFA attempts.
  • Lack of notification renders the vulnerability particularly insidious, as it allows for prolonged and undetected exploitation.

Microsoft, a leading proponent of multifactor authentication (MFA), has long championed its effectiveness in safeguarding user accounts. The company asserts that accounts utilising MFA are over 99 per cent less likely to be compromised.

However, a recent report by Oasis Security has unveiled a significant vulnerability within Microsoft’s MFA implementation, particularly affecting services such as Outlook, OneDrive, Teams, and Azure Cloud.

The oversight raises serious concerns about the security of millions of Office 365 accounts and highlights the need for continuous vigilance in cybersecurity practices.

Microsoft has more than 400 million paid Office 365 users, making the consequences of this vulnerability far-reaching. 

The crux of the vulnerability lies in the lack of rate limiting associated with MFA attempts. Researchers from Oasis discovered that once a user initiates a login session, they are granted a session identifier that permits up to ten consecutive failed attempts to enter the six-digit MFA code.

No restrictions

Alarmingly, there are no restrictions on the number of new login sessions that can be initiated. This loophole allows potential attackers to engage in what is known as “MFA code spraying,” where they can repeatedly guess authentication codes without triggering any alerts or notifications for account holders.

According to the report, the attack is alarmingly straightforward. An attacker with access to a user’s password—often obtained from infostealer logs available on the dark web—can exploit this vulnerability to make an extensive number of attempts at guessing the MFA code.

The researchers noted that the window for entering the correct code was extended to approximately three minutes, allowing for an increased number of attempts beyond the typical thirty-second code generation cycle.

The extended timeframe significantly enhances the likelihood of a successful attack, with a three per cent chance of guessing the correct code within the extended period.

Moreover, the researchers found that achieving a 50 per cent probability of success would only require approximately 24 login sessions, taking about 70 minutes to execute. Throughout this process, account holders remained oblivious to the numerous failed login attempts, as no alerts were triggered.

A critical component

The lack of notification renders the vulnerability particularly insidious, as it allows for prolonged and undetected exploitation.

In response to the responsible disclosure by Oasis Security on June 24, 2024, Microsoft implemented a temporary fix shortly thereafter, and a permanent solution was established by October 9, 2024.

The updated security measures now include a much stricter rate limiting protocol that activates after a specified number of failed attempts, effectively mitigating the risk of such attacks.

While the implementation of MFA remains a critical component of cybersecurity best practices, this incident underscores the importance of robust security measures and the necessity for continuous improvement. Users must remain vigilant and adopt stronger authentication methods, including passwordless solutions, to enhance their account security.